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DESCRIPTION 

RELEASABLE NONVOLATILE MASS LABEL MOLECULES 
BACKGROUND OF THE INVENTION 

The present application is a continuation-in-part application of provisional applications 
Serial No. 60/033^037, filed December 10, 1996, and Serial No. 60/046,719, filed May 16, 1997, 
the entire disclosures of which are incorporated herein by reference without disclaimer. The 
government may own rights in the present invention pursuant to Cooperative Agreement No. 
70NANB5H1029 from the United States Department of Commerce, Advanced Technology 
Program. 

1. Field of the Invention 

The present invention relates generally to the field of chemical analysis. More 
particularly, it concerns a new class of nonvolatile, releasable tag reagents for use in the 
detection and analysis of target molecules e.g., mass spectrometry. 

2. Description of Related Art 

Chemical labels, otherwise known as tags or signal groups, are widely used in chemical 
analysis. Among the types of molecules used are radioactive atoms, fluorescent reagents, 
luminescent reagents, metal-containing compounds, electron-absorbing substances and light 
absorbing compounds. Chemical signal groups can be combined with reactivity groups so that 
they might be covalently attached to the target, the substance being detected. In many cases, 
however, chemical moieties present on the target may interfere with the detection of the signal 
group or not allow for measurement of the signal group in an optimal detection environment. 

Indirect detection of the target is oftentimes, therefore, preferred. For example, the signal 
group may be the product of the degradation of the target or a derivative of the target (Bueht et. 
al 9 1974; Senft, 1985; U.S. Patent 4,650,750; U.S. Patent 4,709,016; U.S. Patent 4,629,689). 
Volatile releasable tag compounds that can be .analyzed using various forms of electron- 
attachment mass spectrometry, often with gas chromatography-mass spectrometry (GC-MS), 
have been described (Wang at, 1996; U.S. Patent 5,360,819; U.S. Patent 5,516,931). Despite 
the broad range of volatile mass labels reported, a transition from liquid to gas phase is required 
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for analysis which places significant synthetic and size parameters on the label. Isotopic mass 
labels have also been described, such as using tin or sulfur isotopes, with various mass 
spectrometric sampling approaches (Arlinghaus et ai 1997; U.S. Patent 5,174,962). The isotopic 
labeling often limits the extent of multiplexing and provides a more complex analysis 
requirement. 

Mass spectral analysis of signal groups involves none of the concerns related to 
radioactive signal groups, such as their short half-lives and their safety and disposal issues. 
Another key advantage to detection of signal groups via mass spectrometry is that it allows a 
great ability to multiplex, to detect for more than one signal group in a complex mixture, and 
therefore more than one target at a time. Brurnmei ei ui. (1994; 1996) have demonstrated the use 
of mass spectrometry in the direct analysis of combinatorial libraries of small peptides. 
However, use of this technology is limited to analysis of the entire reacting compound by mass 
spectrometry. 

Detection of multiple fluorescent labels has been used to analyze nucleic acid sequences. 
Nucleic acid hybridization probes are modified to contain fluorescent chromophores that when 
excited by light emit a unique color spectrum signature. In fluorescence based sequencing 
systems, four different chromophores can be multiplexed within a sample and individually 
detected with the aid of software deconvolution. The practical upper limit for fluorescence 
multiplexing is likely to be around 10 different labels due to the broad overlapping spectrum 
produced by existing fluorescent chromophores. Clearly the development of nonvolatile 
releasable mass labels, detectable over the usable range of a mass spectrometer, would represent 
a significant advantage by permitting the multiplexing of tens, hundreds and perhaps even 
thousands of different mass labels that can be used to uniquely identify each desired target. 

At present, while tools are available through which target molecules may be detected, 
there remains a need for further development of these systems in order to analyze a large number 
of targets simultaneously. This will allow for the systematic analysis of target molecules with 
predetermined properties and functions. 
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SUMMARY OF THE INVENTION 



It is, therefore, a goal of the present invention to provide compositions and methods 
relating to the use of release tag compounds for detection and analysis of target molecules. 

The present invention relates to the use of nonvolatile, releasable tag compounds, 
containing releasable mass labels, in chemical analysis, and to the use of these reagents in 
conjunction with probes which react with or bind noncovalently to a molecule whose presence is 
to be detected. The releasable tag reagents thus may indirectly detect target molecules, including 
biomolecular targets. The mass label may be released from the probe following reaction with or 
binding of the probe to the target and detected by mass spectrometry. The mass value of the 
label identifies and characterizes the probe and, therefore, the target molecule. In the case of a 
mass-labeled oligonucleotide probe used to target a polynucleotide, the detection of mass-labels 
rather than the nucleic acid probes or the nucleic acid targets themselves means that biochemical 
analysis procedures can be greatly simplified. The need for slow, laborious, costly, and/or 
complex solid-phase and/or solution-phase cleanup and desalting procedures can be minimized 
or even eliminated. 

Therefore, in accordance with the present invention, there is provided a release tag 
compound comprising Rx, Re and M, wherein Rx is a reactive group, Re is a release group, and 
M is a mass label detectable by mass spectrometry. As used herein the term "a" encompasses 
embodiments wherein it refers to a single element as well as embodiments including one or more 
of such elements. For example, the phrase "a reactive group" may. refer to a single reactive 
group, but also encompasses embodiments including more than one reactive group. 

Although the mass label may typically be a synthetic polymer or a biopolymer or some 
combination thereof, in some embodiments, the mass label may generally be any compound that 
may be detected by mass spectrometry. In particular embodiments, the mass label may be a 
biopolymer comprising monomer units, wherein each monomer unit is separately and 
independently selected from the group consisting essentially of an amino acid, a nucleic acid, 
and a saccharide with amino acids and nucleic acids being preferred monomer units. Because 
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each monomer unit may be separately and independently selected, biopolymer mass labels may 
be polynucleic acids, peptides, peptide nucleic acids, oligonucleotides, and so on. 

As defined herein "nucleic acids" refer to standard or naturally-occurring as well as 
modified/non-natural nucleic acids, often known as nucleic acid mimics. Thus, the term 
"nucleotides" refer to both naturally-occurring and modi fie d/nonnaturally-occurring nucleotides, 
including nucleoside tri, di, and monophosphates as well as monophosphate monomers present 
within polynucleic acid or oligonucleotide. A nucleotide may also be a ribo; 2*-deoxy; 2\ 3'- 
deoxy as well as a vast array of other nucleotide mimics that are well-known in the art. Mimics 
include . chain-terminating nucleotides, such as 3'-0-methyl, halogenated base or sugar 
substitutions; alternative sugar structures including nonsugar, alkyi ring structures; alternative 
bases including inosine; deaza-modified; chi, and psi, linker-modified; mass label-modified; 
phosphodiester modifications or replacements including phosphorothioate, methylphosphonate, 
boranophosphate, amide, ester, ether; and a basic or complete internucleotide replacements, 
including cleavage linkages such a photocleavable nitrophenyl moieties. These modifications 
are well known by those of skill in the art and based on fundamental principles as described 
Saenger (1983), incorporated herein by reference. 

Similarly, the term "amino acid" refers to naturally-occurring amino acid as well as any 
modified amino acid that may be synthesized or obtained by methods that are well known in the 
art. 

In another embodiment, the mass label may be a synthetic polymer, such as polyethylene 
glycol, polyvinyl phenol, polyproplene glycol, polymethyl methacrylate, and derivatives thereof. 
Synthetic polymers may typically contain monomer units selected from the group consisting 
essentially of ethylene glycol, vinyl phenol, propylene glycol, methyl methacrylate, and 
derivatives thereof. More typically the mass label may be a polymer containing polyethylene 
glycol units. 

The mass label is typically detectable by a method of mass spectrometry. While it is 
envisioned that any known mass spectometry method may be used to detect the mass labels of 
the present invention, methods such as matrix-assisted laser-desorption ionization mass 
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spectrometry, direct laser-desorption ionization mass spectrometry (with no matrix), electrospray 
ionization mass spectrometry, secondary neutral mass spectrometry, and secondary ion mass 
spectrometry are preferred. 

In certain embodiments the mass label has a molecular weight greater than about 500 
Daltons. For some embodiments, it may be preferred to have nonvolatile (including involatile) 
mass labels, however, for other embodiments volatile mass labels are also contemplated. 

As defined herein, the term "reactive group" refers to a group capable of reacting with the 
molecule whose presence is to be detected. For example, the reactive group may be a 
biomolecule capable of specific molecular recognition. Biomoiecuies capable of specific 
molecular recognition may typically be any molecule capable of specific binding interactions 
with unique molecules or classes of molecules, such as peptides, proteins, polynucleic acids, etc. 

Thus, reactive groups disclosed herein for use with the disclosed methods encompass 
polypeptides and polynucleic acids. As used herein, polypeptides refer to molecules containing 
more than one amino acid (which include native and non-native amino acid monomers. Thus, 
polypeptides includes peptides comprising 2 or more amino acids; native proteins; enzymes; 
gene products; antibodies; protein conjugates; mutant or polymorphic polypeptides; post- 
translationally modified proteins; genetically engineered gene products including products of 
chemical synthesis, in vitro translation, cell-based expression systems, including fast evolution 
systems, involving vector shuffling, random or directed mutagenesis, and peptide sequence 
randomization. In preferred embodiments polypeptides may be oligopeptides, antibodies, 
enzymes, receptors, regulatory proteins, nucleic acid-binding proteins, hormones, or protein 
product of a display method, such as a phage display method or a bacterial display method. 
More preferred polypeptide reactive groups are antibodies and enzymes. As used herein, the 
phrase "product of a display method" refers to any polypeptide resulting from the performance of 
a display method which are well known. in the art. It is contemplated that any display method 
known in the art may be used to produce the polypeptides for use in conjunction with the present 
invention." 
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Similarly, "polynucleic acids" refer to molecules containing more than one nucleic acid. 
Polynucleic acids include lengths of 2 or more nucleotide monomers and encompass nucleic 
acids, oligonucleotides, oligos, polynucleotides, DNA, genomic DNA, mitochondrial DNA 
(mtDNA), copy DNA (cDNA), bacterial DNA, viral DNA, viral RNA, RNA, message RNA 
(mRNA), transfer RNA (tRNA), ribosomal RNA (rRNA), catalytic RNA, clones, plasmids, Ml 3, 
PI, cosmid, bacteria artificial chromosome (BAC), yeast artificial chromosome (YAC), 
amplified nucleic acid, amplicon, PCR product and other types of amplified nucleic acid. In 
preferred embodiments, the polynucleic acid may be an oligonucleotide. 

In still further embodiments, Rx is an oligonucleotide having one or more nucleotides or 
oligonucleotide is added after hybridization of Rx to a cofnplemeniary nucleic acid sequence. 
The term complementary generally refers to the formation of sufficient hydrogen bonding 
between two nucleic acids to stabilize a double-stranded nucleotide sequence formed by 
hybridization of the two nucleic acids. 

Typically, nucleotides may be added by a polymerase while oligonucleotides may be 
added by a ligase. However, it is also contemplated that other methods of adding nucleotides 
and oligonucleotides known by those of skill in the art may also be employed. In further 
embodiments, it is provided that the nucleotide added after hybridization may have a chain 
terminating modification, for example, the added nucleotide may be a chain terminating dideoxy 
nucleotide. 

Embodiments are also, provided wherein the -added nucleotide or oligonucleotide further 
comprise a functional group capable of being immobilized on a solid support, for example, a 
biotin or digoxigenin. Generally, this functional group or binding group or moiety is capable of 
attaching or binding the tag compound to the solid support. This binding moiety may be 
attached to the added nucleotide or oligonucleotide directly through an intervening linking group 
or by specific hybridization to an intermediary oligonucleotide which is itself bound to a solid 
support. Binding moieties include functional groups for covalent bonding to a solid support, 
ligands that attach to the solid support via a high-affinity, noncovalent interaction (such as biotin 
with streptavidin). a series of bases complementary to an intermediary oligonucleotide which is 
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itself attached to the solid support, as well as other means that are well-known to those of skill in 
. the art, such as those described in PCT WO 96/37630, incorporated herein by reference. 

In other embodiments, the reactive group may contain a nuclease blocking moiety. These 
5 moieties serve to block the digestion of the oligonucleotide by the nuclease, such as an 
exonuclease. Typical nuclease blocking moieties thus include phosphorothioate, 
alkylsilyldiester, boranophosphate, methylphosphonate, and peptide nucleic acid. 

The mass label is linked, or attached, to the reactive group via a releasable attachment. 
10 Thus, typically the mass label is released from all or a part of the reactive group prior to mass 
spectral analysis as contemplated by the various methods described herein. This releasable 
attachment typically occurs through the use of a release group which may be the linkage between 
the mass label and the reactive group or which may comprise a portion or all of the reactive 
group or which may be contained within the reactive group. 

15 

The release group may be any labile group providing for such a releasable attachment. 
The release group may thus be a chemically cleavable linkage or labile chemical linkage. Such 
linkages may typically be cleaved by methods that are well known to those of skill in the art, 
such as by acid, base, oxidation, reduction, heat, light, or metal ion catalyzed, displacement or 

20 elimination chemistry. In a particular embodiment, the chemically cleavable linkage comprises 
a modified base, a modified sugar, a disulfide bond, a chemically cleavable group incorporated 
into the phosphate backbone, or a chemically cleavable linker. Some examples of these linkages 
are described in PCT WO 96/37630, incorporated herein by reference. As used herein, 
"chemically cleavable linkers" are moieties cleavable by, for example, acid, base, oxidation, 

25 reduction, heat, light, metal ion catalyzed, displacement or elimination chemistry. 

Chemically cleavable groups that may be incorporated into the phosphate backbone are 
well known to those of skill in the art and may include dialkoxysilane, 3'-(S)-phosphorothioate, 
5'-(S)-phosphorothioate, 3'-(N)-phosphoroamidate, or 5'-(N)-phosphoroamidate. In further 
30 embodiments the chemically cleavable linkage may be a modified sugar, such as ribose. 
Alternatively, the linkage may be a disulfide bond. 
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o 

In still yet another embodiment, Re is contained within Rx. In this case, the release of Re 
may be activated by a selective event. In particular embodiments, the selective release is 
mediated by an enzyme such as an exonuclease specific for double-stranded or single-stranded 
DNA. When it is said that Re is contained within Rx, it will generally be understood that the 
reactive group contains within its structure the particular release group which will cause the mass 
label to disconnect from the tag compound in that particular embodiment. 

Thus, release groups encompassed by the invention also include groups or linkages 
cleavable by an enzyme. Enzymatically-cleavable release groups include phosphodiester or 
amide linkages as well as restriction endonuclease recognition sites. 

Preferred embodiments encompass release groups cleavable by nucleases. These 
nucleases may typically be an exonuclease or a restriction endonuclease. Typical exonucleases 
include exonucleases specific for both double-stranded and single-stranded polynucleic acids. 
Additionally, restriction endonucleases encompassed by certain embodiments include Type IIS 
and Type II restriction endonucleases. 

In other embodiments the release group may be cleavable by a protease. Typical 
proteases include endoproteinases. 

Also provided are embodiments wherein Rx comprises a nucleoside triphosphate or is 
synthesized using mass-labeled nucleoside triphosphates. In another embodiment, Rx comprises 
a nucleoside phosphoramidite or is synthesized using mass-labeled nucleoside phosphoramidites. 

In still further embodiments, mass-labeled probes are provided wherein at least one 
component is a nucleoside triphosphate. It is further contemplated that the labeled probes of the 
invention may include at least two unique mass-labels are incorporated. 

Also provided are release tag compounds comprising Rx, Re and M, wherein Rx is a 
double-stranded oligonucleotide comprising a restriction endonuclease recognition site; Re is a 
release group comprising a phosphodiester linkage capable of being cleaved by a restriction 
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endonuclease; and M is a mass label detectable by mass spectrometry. Rx may further include a 
modified nucleotide and the mass label may include a portion of Rx. 



Double-stranded oligonucleotides as provided herein include not only two 
complementary strands hybridized to each other via hydrogen bonding interactions, but also 
include single strands of nucleotides wherein portions of the strand are single-stranded and 
portions are double-stranded. For example, portions or all of Rx may include a self- 
complementary oligonucleotide hairpin where part of Rx is complementary to another part of Rx. 
In this case, certain conditions allow the formation of a double-stranded duplex between these 
two portions of Rx. For purposes of certain embodiments of the present invention, it is not 
necessary that all of Rx need be double-stranded, release tag compounds containing single- 
stranded regions are also contemplated as being within this embodiment. 

Release tag compound are also contemplated having Rx, Re and M, wherein: Rx is a 
double-stranded oligonucleotide; Re is a chemically cleavable release group; and M is a mass 
label detectable by mass spectrometry. In this embodiment, Re is typically located within Rx. 
Cleavage at the chemically cleavable release group is generally inhibited in this aspect by the 
presence of a double-stranded oligonucleotide at the release group. Previously discussed 
chemically cleavable release groups, such as 3'-(S)-phosphorothioate, 5'-(S)-phosphorothioate, 
3'-(N)-phosphoroarnidate, 5'-(N)-phosphoroamidate, or ribose, may be employed with these 
embodiments. In these embodiments, a portion of Rx may be rendered single-stranded at Re by 
hybridization of a portion of Rx to a target nucleic acid. 

Also provides is a set of release tag compounds for detecting a particular target nucleic 
acid. In this aspect, the target nucleic acid typically contains more than one release tag 
compound. Each release tag compound includes the elements Rx, Re and M, where Rx is an 
oligonucleotide including a variable region and an invariant region; Re is a release group; and M 
is a mass label detectable by mass spectrometry. The invariant and variable regions react with 
the target nucleic acid. It will generally be understood by those of skill in the art that the term 
"set" refers to a group of two or more release tag compounds. Generallyeach member, i.e., each 
release tag compound of the group will be different from all other members of the group. That 
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is, each member will include a different combination of reactive group release group and mass 
label. 

Typically, the mass label of at least one member of the set may identify a specific 
sequence within the variable region. In some embodiments, the mass label for each member of 
the set may uniquely identify each different sequence within the variable region. In other 
embodiments, a combination of the mass labels of two or more release tag compounds may 
identify each different sequence within the variable region. 

As previously discussed, Rx may further comprise a nucleotide or oligonucleotide added 
after hybridization to the target nucleic acid. In this aspect, the added nucleotide or 
oligonucleotide may further comprise Re' and M', where Re' is a release group; and M* is a mass 
label detectable by mass spectrometry. The added nucleotide or oligonucleotide may also 
contain a chain terminating moiety or a functional group capable of being immobilized on a solid 
support, such as biotin or digoxigenin. 

Methods of producing a mass-labeled probe are provided, comprising combining 
nucleoside or amino acid monomers with at least one mass-labeled monomer under conditions to 
allow for polymerization. 

Further embodiments are provided wherein the polymerization is mediated by an enzyme. 
Still further embodiments are provided wherein the polymerization is mediated by chemical 
synthesis. The preferred synthetic methods to prepare the compound of the present invention are 
essentially those for standard peptide and DNA synthesis. 

For particular embodiments, synthesis in the solid phase is preferred to allow for a wide 
variety of compounds to be produced using combinatorial methods. 

Additional embodiments are provided for a method of producing a mass-labeled probe, 
comprising the steps of (a) combining nucleoside .monomers with at least one activated 
nucleoside monomer under conditions to allow for polymerization; and (b) adding a releasable, 
nonvolatile mass unit to said activated nucleoside monomer. 
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The present invention also provides embodiments which provide a method for detecting a 
target molecule. Generally, the method includes obtaining a plurality of probes, each probe 
including a reactive group, a release group and a mass label, as described. It is preferred that 
5 each probe within the plurality contains a unique mass-label. By "unique mass label" it is meant 
that each probe within the plurality will have a different mass label from all other probes in the 
plurality. A plurality will generally be understood to include two or more probes. Next, the 
target molecule is contacted with the plurality of probes under conditions suitable to allow for the 
formation of probe: target molecule complexes. The mass-label is released from the probe and 
10 the mass of the mass-label is determined. Typically, the mass is indicative of a specific target 
molecule, in this way, the target molecule can be identified according to the unique combination 
of mass-labels. 

In another aspect, the invention provides a method for detecting a target molecule where 
15 ■ the target molecule is amplified to produce an amplified target molecule. The amplified target 
molecule is then hybridized with a probe such as those described hereinabove to produce probe: 
amplified target molecule complexes. The mass label on the probe amplified target molecule 
complexes are then released and the mass of the mass label determined by mass spectrometry. 

20 The target nucleic acid may be amplified by any method known by one of skill in the art, 

for example, polymerase chain reaction ("PCR"), with PCR being a preferred amplification 
method. The amplification may include a functional group capable of being immobilized on a 
solid support, such as biotin or digoxigenin. This functional group may be attached to an 
oligonucleotide primer incorporated into the amplified molecule during the amplification step or 

25 it may be attached to a nucleotide incorporated into the amplified target molecule during the 
amplification step. 

Methods are also provided wherein the amplified target molecule is immobilized onto a 
solid support and any probe not part of a probe:amplified target molecule complex is removed by 
30 washing. It will be understood by those of skill in the art that the nature of the recognition of the 
target molecule by the reactive group will depend on the identity of the target molecule and the 
reactive group. For purposes of exemplification and not limitation, this recognition may 



WO 98/26095 PCT/US97/22639 

encompass the formation of a double-stranded duplex by hybridization where the reactive group 
and target molecule are oligonucleotides. The mass label may be released enzymatically or 
chemically. 

It is contemplated that useful enzymes for this embodiment will include nucleases, such 
as Type II and IIS restriction endonuclease and exonucleases. The envisioned exonucleases may 
be specific for double-stranded DNA, such as exonuclease III, T4 endonuclease VII, lambda 
exonuclease, and DNA polymerase. For these embodiments the release of the mass label may 
be triggered by the hybridization of the probe to the amplification product. In that embodiment 
the probe would be single-stranded and capable of hybridizing to the target whose presence was 
to be detected. The exonuclease may also be specific for singie-stranded DNA. 

Chemically cleavable linkages may comprise a modified base, a modified sugar, a 
disulfide bond, a chemically cleavable group incorporated into the phosphate backbone, or a 
chemically cleavable linker and are typically cleaved by acid, base, oxidation, reduction, heat, 
light, or metal ion catalyzed, displacement or elimination chemistry. 

Embodiments are provided wherein the reactive group further comprises a nucleotide or 
oligonucleotide added after hybridization to the amplification product, amplified target molecule 
or amplified nucleic acid molecule. These added nucleotides or oligonucleotides may optionally 
include a functional group capable of being immobilized on a solid support. 

For embodiments employing immobilization onto a. solid support, one will typically 
immobilize the reactive group onto the solid support after addition of the nucleotide or 
oligonucleotide then any probes having unbound reactive groups are removed prior to releasing 
the mass label of any probe belonging to a probe: amplified target molecule complex or 
probe :target molecule complex. 

In these embodiments, the reactive and release groups may be the same or the release 
group may be contained within the reactive group. The probe may also comprise at least two 
unique mass labels. 
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Multiplexing methods are also provided wherein the target molecule is contacted with a 
plurality of probes. In these instances, each reactive group of the probe may be associated with a 
unique mass label or it may be associated with a unique set of mass labels. Thus, a target 
molecule may be detected by the mass spectral detection of a particular mass label or a particular 
set of mass labels. Where a set of mass labels is employed, the set of mass labels may be 
attached to the same probe. Alternatively, each member of the set may be attached to a different 
probe. 

Also provided are methods for detecting mismatches wherein the amplified nucleic acid 
product comprises a double-stranded molecule containing a mismatch, and an exonuclease- 
blocking functionality at the 3 f ends of the strands. Typically, this method may further comprise 
cleavage of at least one strand of the double-stranded molecule at the site of the mismatch; and 
selective releasing of the mass label. Selective releasing of the mass label may typically be 
accomplished by digestion of the cleaved strand by a 3' to 5' exonuclease, such as exonuclease 
III, 

As used herein, the term "selective releasing" comprises to the releasing of a mass label 
from a probe which belongs to a probeitarget molecule complex without releasing a mass label 
from a probe not belonging to such a complex without having to physically partition the two 
types of probes. However, some embodiments may include both selective releasing and physical 
partitioning. The described immobilization and washing techniques exemplify a method of 
physical partitioning. 

The mismatch may be cleaved by an enzyme, such as mutHLS, T4 endonuclease VII, 
mutY DNA glycosylase, thymine mismatch DNA glycosylase, or endonuclease V. The 
mismatch may also be cleaved by a chemical, such as Os0 4 , HONH 2 , or KMn0 4 . 

The invention further provides a method for detecting a target molecule including the 
steps of: (a) obtaining a probe including a reactive group, a release group and a nonvolatile mass 
label; (b) contacting a target molecule with the probe to produce probeitarget molecule 
complexes; (c) the selectively releasing the mass label from the probertarget molecule complexes 
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to produce released mass labels; and (d) determining the mass of the released mass labels by 
mass spectrometry. 

Typically, similar chemical and enzymatic release methods may be employed with these 
embodiments. Selective release of the mass label may also be accomplished by employing 
cleavage means that are inhibited by the presence of a double-stranded oligonucleotide at the said 
release group. As used in this context, "at said release group" means that base pairing is 
maintained on both sides of the release group by at least one nucleotide. 

In this embodiment, contacting the probe with the target molecule typically results in the 
release group being present in a single-stranded region because one strand of the prolse interacts 
with the target molecule, for example, by hybridizing to it. 

Another aspect of the invention encompasses a method for multiplexing the detection of a 
target molecule including: (a) obtaining a plurality of probes, each probe including a reactive 
group, a release group and a mass label; (b) contacting the target molecule with the plurality of 
probes to produce probe:target molecule complexes; (c) releasing the mass label from any probe 
belonging to probe:target molecule complexes to produce released mass labels; and (d) 
determining the mass of any released mass label by mass spectrometry. In this aspect, each 
reactive group recognizing a specific target molecule is associated with a unique set of mass 
labels. It may often be preferred that a plurality of target molecules with the plurality of probes. 

The members of the set of mass labels may be attached to the same probe or to different 
probes. Additionally, the same mass label may be a member of sets identifying more than one 
reactive group. Thus, in this embodiment the set of mass labels, and not the individual mass 
label, is unique to a particular reactive group. In this embodiment, probes having a reactive 
group that identifies a particular target may vary in release group and mass label as well as in 
other respects. 

Immobilization and washing techniques may be employed with this embodiment and it 
may be preferred in some embodiments to immobilize a plurality of target molecules onto the 
solid support at spaced locations and to then contact them with the mass-labeled probes. Typical 
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target molecules include a polynucleotide, an antigen, a ligand, a polypeptide, a carbohydrate, 
and a lipid. 

In further embodiments it may be preferred to employ sets of mass labels wherein a mass 
label member of the set represents a particular moiety or functionality or subset of the target 
molecule. For example, mass label A could correspond to a reactive group composed of 
A'X 2 ..-X N functionalities where A can be anywhere in the reactive group and only represents A 1 
and may or may not be structurally related to A' in any way. Thus, detecting mass label results 
in the detection of a target molecule that recognizes A', but does not necessarily identify anything 
else about the structure or composition of the target molecule. 

Thus, methods are provided wherein the unique set of mass labels comprises a mass label 
that indicates the presence of a specified component within the reactive group. Further 
embodiments also include methods wherein the mass label indicates the presence of the specified 
component at a specified location within the reactive group. A reactive group comprising n 
specified components may be associated with a unique set of mass labels having n members 
where n may typically be from 1 to 1000. Generally, mass labels are individually attached to the 
reactive group and are identified intact. 

A reactive group comprising n specified components may also be associated with a 
unique set of mass labels having y members wherein n is less than y!/[x!(y-x)!]; and wherein x 
comprises the number of mass labels per reactive group. 

In some embodiments a plurality of probes may each comprise a known reactive group 
having a known set of mass labels and the plurality of probes may be prepared by combinatorial 
synthesis. The plurality of target molecules may also comprise a known chemical structure. 

Also provided is a method of monitoring gene expression including (a) obtaining a 
plurality of probes, each including a reactive group, a release group and a mass label; (b) 
contacting a plurality of target nucleic acids with the plurality of probes to produce probertarget 
nucleic acid complexes; (c) selectively releasing the mass label from any probe belonging to a 
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probe:target nucleic acid complexes to produce released mass labels; and (d) determining the 
mass of any released mass label by mass spectrometry. 

Typically, the target nucleic acids may have sequences representative of the genes being 
expressed in a particular cell culture and are present in concentrations related to their mRNA 
abundance levels. The target nucleic acids may typically comprise mRNA or first-strand cDNA 
as well as amplified nucleic, acid products. 

Such amplified nucleic acid products may be produced using PCR, rtPCR, LCR, Qbeta 
Replicase, SDA, CPR, TAS, NASBA, or multiple rounds of RNA transcription or some 
■combination -thereof. Amplification may be used to selectively amplify a subset of the mBNA 
pool increasing detection signal for these gene products and reducing background from gene 
products outside of the amplified subset. 

Another embodiment encompasses a method of monitoring gene expression including 
amplifying a subset of an mRNA pool to produce a plurality of amplified nucleic acid products; 
contacting a plurality of amplified nucleic acid products with a plurality of probes, each probe 
including a reactive group, a release group and a mass label to produce probe:amplified nucleic 
acid product complexes selectively releasing the mass label from any probe belonging to a 
proberamplified nucleic acid produce complexes to produce released mass labels determining the 
mass of any released mass label by mass spectrometry. 

For this embodiment, one more probes or amplified nucleic acid products may be 
capable of being immobilized onto a solid support. 

Another aspect of the invention is a method for detecting a target molecule, including 
contacting a target molecule with a probe including a reactive group, a release group and a 
nonvolatile mass label to produce probe:target molecule complexes; releasing the mass label 
from any probe belonging to a complex to produce released mass labels; selectively desorbing 
the released mass label from the mass spectral matrix such that the probes not belonging to 
probe:target molecule complexes do not desorb; and determining the mass of the released mass 
label by mass spectrometry. 
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For these embodiments, the mass label should desorb more efficiently from the mass 
spectral matrix than the probe or the mass-labeled probe. Preferred mass spectral matrices 
include 2,5-dihydroxybenzoic acid, sinapinic acid, or alpha-cyano-4-hydroxycinammic acid. 

A method for detecting a target molecule is also provided. This method includes 
amplifying one or more target nucleic acids to produce amplified nucleic acid products; 
incorporating one or more molecules including a reactive group, a release group and a 
nonvolatile mass label into the amplified nucleic acid product during the amplification process; 
selectively releasing the mass labels incorporated into the amplified nucleic products to produce 
released mass labels; and '-determining the mass of the released mass labels by mass 'Spectrometry. 

Incorporated molecules may be oligonucleotide primers and nucleoside triphosphates and 
the amplified nucleic acid products are produced using PCR, rtPCR, LCR, Qbeta Replicase, 
SDA, CPR, TAS, NASBA, or multiple rounds of RNA transcription or some combination 
thereof. One or more second molecules, each including a functional group capable of being 
immobilized on a solid support, may also be incorporated into the amplified nucleic acid 
products. The functional group may also be used to bind the amplified nucleic acid products to a 
solid support, and separate incorporated mass labeled molecules from unincorporated mass 
labeled molecules. It may also be preferable to separate the amplified nucleic acid products from 
the unincorporated mass labeled molecules, for example, by binding the amplified nucleic acid 
products to a solid support or by hybridizing the amplified nucleic acid products to a 
polynucleotide bound to solid support. In the latter case, the bound polynucleotide may be an 
oligonucleotide, a polyribonucleotide, a plasmid, an Ml 3, a cosmid, a PI clone, a BAC or a 
YAC. A plurality of these polynucleotides may also be immobilized onto the solid support at 
spaced locations. 

Also provided is a method for detecting the presence of a target nucleic acid molecule, 
said method comprising: obtaining a probe comprising a reactive group, a release group and a 
mass label; contacting the probe to a target nucleic acid molecule to produce probe:nucleic acid 
molecule complexes; mass modifying the probe:nucleic acid molecule complexes by attaching a 
nucleotide or oligonucleotide to the probe to produce mass modified mass labels; releasing the 
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mass modified mass labels; and determining the mass of the mass-modified mass labels by mass 
spectrometry. 

Another embodiment encompasses a method for detecting specific biomolecules in an 
enzyme-linked affinity assay comprising: obtaining a substrate; contacting a target molecule with 
an affinity ligand-enzyme conjugate to produce an affinity ligand-enzyme conjugate: target 
molecule complex; contacting the affinity ligand-enzyme conjugate:target molecule complex 
with the substrate to produce a mass modified product; and determining the mass of the mass 
modified product by mass spectrometry. 

As used herein, ''"affinity iigands" are groups, molecules, or moieties having an affinity 
for, or reacting with a particular target molecule, similar to the reactive groups employed with 
the mass label probes disclosed above. The affinity ligand may be a biomolecule capable of 
specific molecular recognition, such as a polypeptide or polynucleic acid. Preferred polypeptides 
include antibodies, enzymes, receptors, regulatory proteins, nucleic acid-binding proteins, 
hormones, and protein products of a display method, such as products of a phage display method 
or a bacterial display method. 

The enzymes conjugated to these affinity Iigands may be any enzyme that catalyze the 
conversion of the substrate to a product having a different mass, such as restriction 
endonucleases and proteases. Thus, the mass of the substrate has been modified in the 
production of the product by the enzyme. Affinity ligand-enzyme conjugates are molecules 
where the affinity ligand and enzyme have been attached by the formation of covalent or 
noncovalent interactions, including hydrogen bonds. 

In some embodiments it may be preferable to employ a plurality of restriction 
endonucleases. In these cases, the various endonucleases may be conjugated to the affinity 
ligand to form several affinity ligand-enzyme conjugates which are then contacted with the target 
molecule. Similarly, it may be preferable to employ a plurality of affinity ligand-enzyme 
conjugates having different affinity Iigands, enzymes, or both. 
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The substrate may be any molecule whose conversion to a mass-modified product is 
accomplished by the enzyme employed such as a polypeptide. For embodiments employing 
restriction endonucleases, it may therefore comprise a restriction site. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better understood by 
reference to one or more of these drawings in combination with the detailed description of 
specific embodiments presented herein. 

FIG. 1A and FIG. IB show generalized examples of two mass-labeled building blocks 
for the preparation of mass-labeled polynucleotides, a mass-labeled nucleoside triphosphate 
(FIG. 1A) and a mass-labeled nucleoside phosphoramidite (FIG. IB). In these FIGS., B refers 
to a base, R to an optional releasing linkage, and M to a mass label. Mass labels may also be 
added after polynucleotide synthesis via linker reagents. 

FIG 2A and FIG. 2B show examples of a mass-labeled probe where the releasable group 
is contained within the reactive group and the released mass-label includes one or more 
monomers of the reactive group. 

Shown in FIG. 2A is the use of the probe as an oligonucleotide primer that can be 
extended (Step A) by polymerase using nucleoside triphosphates, including deoxy and 
dideoxyribonucleotide or combinations thereof, or by ligase using oligonucleotides. Ligase may 
be used to attach oligonucleotides to the 5' as well as the 3' end. Nucleotides and 
oligonucleotides, added as well as nucleotide monomers within the probe may optionally consist 
of modified nucleotides or non-natural, mimic nucleotides. Also shown is the optional use of a 
solid-phase binding group such as biotin (labeled B) that can be used to capture the extended 
mass-labeled primer prior to release of the mass-label product (Step B). Following release the 
mass-labeled product is analyzed by mass spectrometry (Step C). The non-reactive group 
component of the mass label is indicated by Mx, where the x signifies that this component may 
have a single molecular mass or it may represent a combination of 2 or more molecules of 
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defined mass. The Mx component may be optionally contained fully within the reactive group 
and may be comprised of nucleotides or non-natural, mimic nucleotides. Determining the mass 
of the mass-label product provides the means for identifying the nucleotide composition and 
sequence of bases immediately adjacent to the probe. 

FIG. 2B illustrates the specific case where the mass-labeled probe functions as a primer 
to detect a single nucleotide polymorphism. In Step A, following hybridization to a template 
nucleic acid, a polymerase is used to add a single nucleotide chain terminator or mass-modified 
version thereof, selecting from the four possible bases. Following probe extension, the mass- 
labeled product is released (Step B) and analyzed by mass spectrometry (Step C). As in FIG. 
2A, the probe optionally comprises a solid-phase binding group thai may be used to bind and 
wash the probe prior to the releasing step. In this example a T chain terminator is added 
increasing the mass of the mass-label product by 298 Da, indicating the presence of an A within 
the template at the targeted position. 



FIG. 2C illustrates a different embodiment for the use of a mass-labeled probe in the 
determination of single nucleotide polymorphisms. A mass-labeled probe is hybridized to a 
template and is extended by polymerase which incorporates a single chain-terminating nucleotide 
(Step A). The chain terminating nucleotide is modified to contain a solid-phase binding group 
such as biotin (labeled B) that is used to capture the extended mass-labeled primer prior to 
release of the mass-label product (Step D). In this particular illustration the probe is being used 
to identify whether or not an A nucleotide is present in the position adjacent to where the probe 
hybridizes. While the reaction may include all four chain terminating nucleotides, only the T 
chain terminator is modified to carry a solid-phase binding group. Therefore only if T 
incorporates, and A is present in the template, will the mass-labeled probe be modified and 
captured to the solid phase (Step B) . Use of a washing step (Step C) prior to release (Step D) 
will remove any probes that have not incorporated T, removing their mass labels from the 
system. Only probes that were bound to the solid phase (Step B) will be detected in the mass 
spectrometer (Step E). The mass label is indicated by Mx, where the x signifies that this 
component may have a single molecular mass or it may represent a combination of 2 or more 
molecules of defined mass. A multiplex of many different probes is possible. The release group, 
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Re, may be placed in the linker connecting the mass label to the probe, or at any position within 
the backbone of the probe. This methodology may be extended to cases where a combination of 
nucleotides and chain-terminating nucleotides are used, as well as oligonucleotides, where 
particular components are selected to contain a solid-phase binding group. 

FIG. 3A and FIG. 3B illustrate a generalized scheme to produce a mixture of nucleic 
acid probes each with a unique single or combination of mass labels (FIG. 3A) and, in particular, 
a generalized scheme to incorporate mass-labeled nucleotides or oligonucleotides into a 
polynucleotide sequence using DNA polymerase (Step A) or ligase (Step B) (FIG. 3B). 

FJG, 3A illustrates a nucleic acid probe containing an invariant region and a variable 
region. The invariant region, which is optional, carries the same or near the same sequence for 
all probes within a family. The variable region contains all possible sequences or some subset 
thereof. As an example, if the variable region is 4 nucleotides in length 256 different probes can 
be made, if the variable region is 6 nucleotides in length 4096 different probe can be made. 
Associated with each probe sequence is a single or combination of mass labels. In either case, 
the mass labels chosen are unique to each sequence. In cases where combinations are used the 
mass labels (labeled M) may be single labels attached to different probes carrying the same 
sequence or multiple labels attached to a single probe, or some combination thereof. 

FIG. 3B illustrates two embodiments where the mass-labeled family of probes may be 
used to screen a nucleic acid template. In addition to simple hybridization of the probe to 
template, the probes may be extended using either polymerase (Step A) or ligase (Step B). In 
either case nucleotides or oligonucleotides may be used that carry additional mass labels (labeled 
M*) identifying the sequence of the nucleic acid product being added, therefore enlarging the 
total template sequence determined per probe hybridization event. In a prefered embodiment the 
template is bound to the solid phase. Alternatively, the nucleotides or oligonucleotides added to 
the probe may contain a solid-phase binding group, enabling the isolation of the probe and 
attachment via solid-phase capture. As illustrated, X-Y represents Watson-Crick base pairing in 
the variable region of the probe, and N-M represents Watson-Crick base pairing in the added 
nucleotide or oligonucleotide sequence. 
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FIG. 4A, FIG. 4B, and FIG. 4C illustrate different combinatorial approaches to 
preparing mass labeled probes (FIG. 4A), using mass-labeled probes to screen a vector insert 
(FIG. 4B), and enzymatic methods, including transcription and PCR for the preparation of large 
mass-labeled polynucleotide probes (FIG. 4C). 

5 

FIG. 4 A describes an example of how combinatorial labels may be used to label a 
complex set of oligonucleotides. The example describes a set of probes that have a variable 
region 4 nucleotides long comprising 256 possible sequence combinations. Variable regions 
shorter or longer are also possible. In the table and example list (C), it is shown how a set of 16 

10 different mass labels may be used to create a mass label signature that is unique for all 256 
combinations. Two different approaches may be used to creating the labeled probes, the first (A) 
being the use of 16 different phosphoramidites each containing a different mass label that are 
used according to the base and position of synthesis. This approach leads to a set of molecules 
each with 4 labels on them and is performed as a single reaction. Variants are possible where the 

1 5 synthesis is split into multiple pots and standard phosphoramidite are used in some positions to 
reduce the number of labels per molecule. The second combinatorial approach (B) is to 
presynthesize the 256 combinations in 16 different reactions prior to adding the mass labels, each 
of which is used to define one of the 4 bases in.one of the 4 positions. Following oligonucleotide 
synthesis, each of the 16 different reactions is coupled to one of 16 different mass labels. The 

20 end product is that each probe in the pool contains only one specific mass label. The second 
approach offers greater flexibility for the placement and type of the mass label since it is not 
. coupled directly to the oligonucleotide synthesis. Other labeling schemes can be envisioned 
when using the post oligonucleotide sythesis method especially when the oligonucleotide set is 
synthesized in a larger number of reactions, with ultimate flexibility if the 256 combinations are 

25 all synthesized separately. With either approach the synthesis may optionally include an 
invariant synthetic region as shown in FIG. 4A. The variable region may also include one or 
more discontinuous bases within the invariant region. These probes may be applied to screening 
for polymorphisms in diagnostic and genomic applications including single nucleotide 
polymorphisms where the variable region is only one nucleotide long. 

30 

FIG. 4B describes how the combinatorially. labeled probes may be used to screen 
polymorphic sequences that are adjacent to the insert sequences within cloning vectors (A), 
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including cDNA and genomic clones. The use of an invariant sequence within the probes allows 
the probes to be anchored at the junction between the known vector sequence and the unknown 
insert sequence with the invariant region of the probe hybridizing to the known sequence and the 
variable region selecting its complement in the unknown region (B). Methods utilizing these 
probes include simple hybridization to one or both of the clone insert ends, nucleotide or 
oligonucleotide extensions as described in FIG. 3B, and use of the probes for primer extension to 
make a single copy of the insert or for purposes of amplification. For a given insert sequence, 
use of forward and reverse probes in a PCR amplification would result in the selection of only 
one forward and one reverse probe out of the set to create the amplification product. This 
technique can be combined with a number of different selective mass label release 
methodologies to identify sequences. 

FIG. 4C illustrates two different methods for creating mass-labeled polynucleotide 
probes by either transcription (A) or PCR amplification (B). Use of RNA transcription to 
synthesize mass-probes is limited to sequence regions that are downstream from a promoter 
sequence (labeled P). Typical synthetic procedures would utilize RNA polymerase and 
ribonucleoside triphosphates, including mass-labeled versions that may carry one or more mass 
labels. Shown in (A) is a transcription vector carrying a transcription promoter and a clone insert 
sequence to be transcribed downstream. The vector also carries one or more restriction sites 
(labeled R) that may optionally be cut to control the length of transcripts. Virtually any 
amplification technique may be used to create mass-labeled probes including PCR, as is shown 
in (B). PCR amplification requires the use of two opposing primers to enable exponential 
amplification of the sequence located between them. One or more mass labels may be placed on 
one or both of the primers or optionally incorporated through the use of mass-labeled nucleoside 
triphosphates. 

FIG. 5A and FIG. 5B illustrate schemes for detecting mutations using mismatch specific 
techniques with enzymatically sythesized mass-labeled probes. Generally the methodology 
requires the cross hybridization of normal and mutant or polymorphic nucleic acid to form a 
double-stranded product containing a mismatch; enzymatic or chemical cleavage at the site of a 
mismatch; and cleavage induced digestion of the probe to release one or more mass labels. In the 
example shown in FIG. 5A and continued in FIG. 5B, a double-stranded mass-labeled nucleic 
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acid probe is synthesized using PCR (A), the 3' ends of the product are blocked from exonuclease 
digestion (B), the PCR probe is hybridized to mutation carrying DNA (C) which leads to the 
formation of a base-pair mismatch, the mismatches are cleaved (D), the cleaved products are 
digested with a 3' to 5' exonuclease (E), the mass labels are released (F) and analyzed by mass 
5 spectrometry (G). Examples of 3' exonuclease blocking groups include nucleotide mimics 
incorporated near the 3' end, such as nucleotides contains boranophosphates or 
phosphorothioates, or the use of 3' overhangs created during nested-set PCR or by template 
independent extension by terminal transferase in combination with a double-strand-specific 3' to 
5' exonuclease, such as exonuclease III, that does not recognize or digest 3 1 overhangs. 
10 Examples of mismatch specific cleavage agents for use in (D) include the chemical Os0 4 , 
KMn0 4 . and HONH 2 , and enzymes, such as rnutl-ILS, T4 endonuciease VII, mutY DNA 
glycosylase, thymine mismatch DNA glycosylase, or endonuciease V. Methods using RNA or 
RNA/DNA hybrids are also possible. 

15 FIG. 6A, FIG. 6B and FIG. 6C illustrate schemes for the synthesis of peptide-linked 

nucleoside triphosphates (FIG. 6A), an oligonucleotide with a linker molecule that contains a 
release group, a disulfide, and a terminal amino-modification for coupling a peptide of some 
other mass label component to the end (FIG. 6B) S and a scheme for the synthesis of a peptide- 
linked nucleoside phosphoramidite (FIG. 6C). 

20 

FIG. 7A and FIG. 7B. show the mass spectra of the unconjugated oligonucleotide (FIG. 
7A) and the oligonucleotide-peptide conjugate (FIG. 7B) of Example ID. The spectrum of FIG. 
7A contains in addition to the signal for the desired oligonucleotide at m/z 7052, signals showing 
the presence of two significant synthesis failures that correspond to one base and three bases less, 
25 and also signals of doubly charged ions for each of these. The spectrum of FIG. 7B shows that 
the purified conjugate is of similar purity to the starting oligonucleotide. 

FIG. 8A, FIG. 8B, FIG. 8C, and FIG. 8D show the mass spectra of a hybridized, mass- 
labeled probe and target in a buffer after Exonuclease III digestion (FIG. 8A), a hybridized, 
30 mass-labeled probe and target incubated with no Exonuclease III (FIG. 8B), of a mass-labeled 
probe in buffer incubated with Exonuclease III (FIG. 8C), of a mass-labeled probe incubated 
with Exonuclease III buffer in the presence of a non-complementary. 36-mer target (FIG. 8D). 
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As shown in these FIGS., the mass label is released only in the presence of the exonuclease and a 
complementary target strand. 

FIG. 9A, FIG. 9B and FIG. 9C compare solid support grid assays using a radioactively- 
labeled probe (FIG. 9A), fluorescently-labeled probes (FIG. 9B) and mass-labeled probes (FIG. 
9C). 

FIG. 9A describes the classical approach to probing nucleic acid samples arrayed on a 
spaced grid. Commonly nucleic acid samples representing mRNA isolates, cDNA clones, 
genomic clones are arrayed on a nylon membrane or filter grid (A). Following a 
photocrosslinking process to covalently attach the samples to the rriembrane, a radioactive probe 
(B) (labeled A), in solution, is added and incubated with the grid (C). The probe hybridizes to 
positions in the grid where the nucleic acid samples contain a length of sequence complementary 
to the probe. After wash step the grid is exposed to X-ray film and the hybridization positions 
are identified (indicated by the A positions in the grid) (D). 

FIG. 9B illustrates the extension of the process in FIG. 9A, to the use of fluorescently- 
labeled probes (B). Because of the different emission spectra of different fluorescent labels it is 
possible to multiplex a small number, e.g. 4 (labeled A, B, C, D), of differently labeled 
fluorescent probes and cross hybridize them to the grid (C). In the case where fluorescence is 
used, the grid may be composed on a glass plate, rather than a filter or membrane, to enable 
fluorescence scanning techniques. 

FIG. 9C illustrates the use of mass-labeled probes (B) (labeled A-S) for hybridization 
against a gridded array of nucleic acid samples. Either single or combinatorial labeling 
techniques may be used to create a few to millions of different probes, all simultaneously 
hybridized against the array. The grid (D), which may be a nylon membrane or some other 
conductive material may be scanned directly in the mass spectrometer following hybridization, 
wash, mass-label release, and matrix addition steps (C). Scanning each position of the grid in the 
mass spectrometer reveals one of the many possible mass-label signatures associated with each 
unique probe. Typical examples of assays that would use this technology include the use of 
known gene-specific probes against gridded cDNA clones, mRNA, cDNA or amplified cDNA 
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pools. Genomic probes, both known or unknown against gridded genomic clones. mRNA, 
cDNA, amplified cDNA against known gridded genes. 

FIG. 10A and FIG. 10B compare library expression analysis using a fluorescence based 
system (FIG. 10A) and a mass-labeled system (FIG. 10B). Fluorescence labeling of pairs of 
cDNA pools derived from mRNA is used to cross compare the gene expression patterns between 
two different biological samples. 

In FIG. 10A, one cDNA pool is labeled with fluorescent tag A while the other pool is 
labeled with fluorescent tag B (A). These pools have their concentrations normalized and are 
mixed (B). The mixture of the pools is then hybridized against a gridded, reference array of 
known genes, typically arrayed as cDNA clones. Following hybridization the array is scanning 
fluorimetrically and the ratio of the two tags is measured for each location. For a given location 
if tag A is twice the intensity of tag B, it is determined that the gene, which is gridded to that 
location, is expressed as mRNA at twice the concentration for sample A than for sample B. 

FIG. 10B, expands the concept of competitively hybridizing cDNA pools beyond the 2 
pool level. The use of releaseable mass labels provide the means for the preparation of many 
more pools (A) (labeled A-H), cross-competitive hybridization (B), and detection (C) of many 
more pools of expressed message ail simultaneously. 

FIG. 11 illustrates the basic principal of release of a mass label from a nucleic acid probe 
for analysis by mass spectrometry. The mass label, Ml, is released either chemically or 
enzymatically (A) and detected by mass spectrometry (B). 

FIG. 12 illustrates selective release of mass labels following hybridization of a nucleic 
acid probe to a target DNA sequence. Mass-labeled nucleic acid probes (A), that may contain 
more than one label (as shown), and having different masses of mass label (not shown), are 
hybridized to a complementary nucleic acid target (B) to form a double-stranded complex (C). 
This complex is recognized by a double-strand-specific exonuclease and the probe is digested 
(D), releasing mass labels from the probe (E). For processive exonucleases the process will 
continue (F) until the entire probe is digested (G). The digestion is then analyzed by mass 
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spectrometry and the released mass labels are detected (H). Mass labels comprise at least one 
nucleotide when digested by an exonuclease. 

FIG. 13 illustrates the separation of peptides A-G by MALDI mass spectrometry where 
A is angiotensin I, B is substance P, C is CGYGPKKKRKVGG (SEQ ID NO:2), D is 
TCVEWLRRYLKN (SEQ ID NO:7), E is CSRARKQAASIKVSADR (SEQ ID NO:8), F is 
oxidized A-chain insulin and G is melittin. 

FIG. 14 illustrates a schematic representation of a process by which a series of gene- 
specific mass-labeled nucleic acid probes are used to detect and quantify the amount of different 
targeted mRNAs within a given sample. A starting pool of nucleic acid '(A), that is the mRNA, 
cDNA copy of the mRNA, or some amplified multiplex of nucleic acid derived from the mRNA, 
is mixed with a set of message-specific mass-labeled nucleic acid probes (B) (probes with 
different mass labels labeled A-S). The mixture is allowed to hybridize (C) wherein probes that 
find complementary messages in the pool form double-stranded complexes, wherein the 
concentrations of the gene-specific double-stranded complexes is proportional to the levels of 
mRNA present in the starting material. Following the formation of double : stranded complexes, 
the mixture is treated with a double-strand-specific nuclease, e.g. exonuclease III treatment, 
selectively releasing mass labels from probes that had hybridized (D). The released mass labels 
(labeled A-S) are then analyzed by mass spectrometry (E), wherein the quantity of each mass 
label detected is proportional to the levels of mRNA present in the starting material. The 
selective release step may optionally use double-stranded chemical release probes as well as 
solid phase capture methods to differentiate double-stranded probes from unhybridized single- 
stranded probes. 

FIG. 15A and FIG. 15B shows two mass spectra. For FIG. ISA, an rtPCR™ reaction 
was performed using a pair of mass-labeled primers targeted at the mRNA for ribosomal protein 
L7. Following the PCR™, the reaction mix was treated with the double-strand-specific 
exonuclease T7 gene 6 exonuclease. Only when a double-stranded PCR™ product is formed 
does the exonuclease digest the product and release the two mass labels, as indicated by two 
peaks in the spectrum. ^In FIG. 15B, a control was performed where a single-stranded, mass- 
labeled primer was incubated with T7 gene 6 exonuclease. No digestion occurred. 
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FIG. 16 illustrates the release of a series of seven different mass-labeled probes which 
were hybridized to seven different cDNA plasmids and then treated with exonuciease III. An 
aliquot of the double-strand-specific digestion was taken and analyzed by mass spectrometry. 
The mass spectrum is shown with the peaks corresponding to each mass label signal labeled A- 
G. 

FIG. 17A and FIG. 17B shows two mass spectra from a SNP analysis using a mass- 
labeled primer and a biotinylated dideoxynucleoside triphosphate. In FIG. 17A a 
complementary match is made between the polymorphic base on the . template and the 
biotinylated dideoxynucleoside triphosphate. The mass-labeled primer has been extended and 
therefore biotinylated, which allows it to be captured to a streptavidin-coated surface, washed 
and subsequently cleaved from' the surface. FIG. 17B shows a mass spectrum from a reaction in 
which the base at the polymorphic site is not a complementary match to the biotinylated 
dideoxynucleoside triphosphate present in the reaction. No extension of the primer occurred as 
evidenced by the absence of a mass spectrometric signal for the primer mass label. The 
unextended primer is not captured on the streptavidin-coated surface and is removed in the 
subsequent washes. 

FIG. 18 shows a mass spectrum from a multiplex SNP analysis in which three differently 
mass-labeled primers for three different polymorphic sites are all simultaneously extended with a 
biotinylated dideoxynucleoside triphosphate. The three extended primers are all capable of being 
captured on a streptavidin-coated surface, washed to remove unextended primers and then 
cleaved from the surface. 

FIG. 19A and FIG. 19B shows two mass spectra from a SNP analysis in which the 
extension is carried out a few bases past the . polymorphic site and for which biotin is 
incorporated through a biotinylated deoxynucleoside triphosphate. The mixture of triphosphates 
in the reactions consists of deoxy-ATP, biotinylated-deoxy-CTP, and dideoxy-TTP. 

In FIG. 19 A the spectrum is from .a reaction in which the polymorphic site on the 
template, located one base past the 3' -end of the primer, is a T. Since the polymorphic site is a 



WO 98/26095 ^ PCT/US97/22639 

complementary match to one of the deoxynucieoside triphosphates in the reaction, the primer is 
extended past the polymorphic site, and subsequently incorporates a biotinylated-dCTP before 
terminating chain extension with the dideoxynucleoside triphosphate. 

The reaction whose spectrum is shown in FIG. 19B is one in which the polymorphic site 
on the template is A. Therefore a dideoxy-TTP is incorporated at the first base past the primer, 
and chain extension is terminated prior to incorporation of the biotinylated-dCTP, which results 
in a lack of signal in the mass spectrum. 

FIG. 20A and FIG. 20B show two mass spectra from primer extension analyses in 
which a .mixture of three primers, differing only in -their 3 '-end-bases and each containing unique 
mass labels, is extended with biotinylated dideoxynucleoside triphosphate. In FIG. 20A the 
mass spectrum shows signal predominantly for the primer whose 3*-end base (primer A) is a 
perfect match for the template used in the reaction. The spectrum in FIG. 20B is from a reaction 
in which the template is changed from the reaction in FIG. 20A in such a way that the 3 '-end 
base matches to a different primer and gives predominantly signal from extension of primer E. 

FIG. 21 A and FIG. 21B show two mass spectra comparing the chemical cleavage rates 
for double-stranded versus single-stranded DNA. A cleavable oligonucleotide containing a 5'-S- 
P bond is cleavable by AgN03. Two cleavage reactions are run. In the first reaction the 
cleavable oligonucleotide is hybridized to a complementary oligonucleotide to make it double- 
stranded prior to adding cleavage reagent. The second reaction is performed on single-stranded 
oligonucleotide. The mass spectrum in FIG. 21 A shows the products from cleavage of double- 
stranded DNA. The cleavage products are expected at masses of 6560 Da and 1470 Da, while 
the uncleaved oligonucleotide is seen at 8012 Da. The spectrum of FIG. 21A indicates that only 
about 5% cleavage has occurred. The spectrum in FIG. 21B, which is from cleavage of single- 
stranded oligonucleotide demonstrates that under the same conditions, cleavage is about 90% 
complete. 

FIG. 22 A and FIG. 22B show two mass spectra from a probe assay of a gene-specific 
RNA transcript. Two exonuclease III digestions reactions are run. In both reactions a mixture of 
two probes is present and the template consists of either RNA transcript or the DNA PCR- 
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product template from which the RNA is transcribed. Only one of the probes is complementary 
to the RNA transcript the other probe is complementary to the opposite strand. Therefore if mass 
label signal is obtained from the DNA PCR product, signals for both probes are seen, while if the 
signal is obtained from RNA transcript, only one signal is seen. 

In FIG. 22A the mass spectrum shows the resulting released mass label for the reaction 
in which RNA transcript is present. Since only one signal is seen, the signal must come from 
digestion of the probe hybridized to the RNA transcript. The second reaction contains a 100-fold 
greater amount of DNA PCR product than is present in the first reaction, and no RNA transcript. 

FIG. 22B shows the mass spectrum resulting from the second reaction. The presence of 
signals from both probes confirms the fact that the signal in FIG. 22A comes from RNA- 
hybridized probe. 



FIG. 23A, FIG. 23B, FIG. 23C, and FIG. 23D show a set of four mass spectra which 
compare the analyte selectivity of two different matrices for MALDL The samples used for the 
comparison are equimolar mixtures of a nucleotidylated peptide and an oligonucleotide obtained 
by a selective chemical cleavage of an oligonucleotide-peptide conjugate. FIGS. 23 A and 23B 
compare spectra of the same sample obtained with 2,5-dihydroxybenzoic acid matrix (FIG. 23A) 
and with 3-HPA matrix (FIG. 23B). The peptide signal predominates in FIG. 23A while the 
oligonucleotide predominates in spectrum FIG. 23B due to differing desorption selectivities or 
efficiencies of the matrices for the peptide and the oligopeptide. The spectra in FIG. 23C and 
23D make the same comparison with a different sample showing that the ionization selectivity is 
general. 

FIG. 24 illustrates the use of a double-stranded, mass-labeled nucleic acid probe for 
detecting and quantifying the presence of a nucleic acid target sequence. Contained within the 
double-stranded probe is a chemical cleavage group that, under proper conditions, only cleaves 
when the nucleic acid probe is single-stranded. Examples of chemical cleavage groups that 
demonstrate enhanced cleavage rates when single stranded include chemically labile nucleic acid 
backbone modifications such as 5'-(S)-phosphorothioate, 3'-(S)-phosphorothioate, 5'-(N)- 
phosphoramidate, 3 '-(N)-phosphoramidate, and ribose. Probing of a nucleic acid target sequence 
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involves combining the double-stranded probe (A) with the single-stranded target (B) and 
allowing them to denature and anneal under equilibrium conditions (C). The probe strand 
containing the mass label and single-strand-specific release group (labeled Re) is homologous to 
the target nucleic acid; the complementary strand is also complementary to the target. The other 
products of this equilibrium event are the mass-labeled, cleavable strand in single-stranded form 
(D), and the complementary strand annealed to the target (E). The amount of complementary 
strand released from the mass-labeled strand and annealed to the target is proportional to the 
concentration of the target nucleic acid. Following the annealing process the probes are treated 
with a single-strand-specific chemical cleaving agent (F) yielding cleaved single-stranded probe 
(G) and detected and quantitated by mass spectrometry (H). As with other mass-labeled probes 
described here, the mass label may be wholely or only partially contained within the nucleic acid 
probe or reactive group and may include the use of nucleic acid mimics. 

FIG. 25 illustrates the use of mass-labeled substrates in enzyme-linked affinity assays. 
Specifically illustrated are the cases where the target molecule (labeled T) is a protein (A) and a 
nucleic acid (B). In illustration (A), an antibody (labeled Ab) is used to recognize the solid- 
phase bound target. The antibody is conjugated to the enzyme (labeled E) used to produce 
signal. In this particular affinity assay, the enzyme recognizes a mass-label substrate (labeled 
MX) and converts it to product which in this example is a cleavage event to form two products 
(labeled M and X) which are then analyzed by mass spectrometry. Regarding the mass label 
substrates, the primary requirement is that the enzyme modify the mass of the substrate when it is 
converted to product by either adding or removing chemical moieties from the substrate. In 
illustration (B), the antibody has been replaced by a nucleic acid probe that is then conjugated to 
the signal producing enzyme. The assay is extremely generalizable and one skilled in the art 
would be able to identify a variety of combinations of probe and target, as well as enzymes and 
mass-label substrates that may be used. 

FIG. 26 illustrates two examples of mass-label substrates for use in enzyme-linked 
affinity assays. Specifically illustrated are two examples, (A) a double-stranded oligonucleotide 
containing a restriction endonuclease site (labeled R), and (B) a polypeptide containing a specific 
proteolytic linkage. In both examples it is possible to develop a repertoire of enzymes and mass- 
label substrates, since a variety of restriction endonucleases and proteases exist that exhibit either 



WO 98/26095 ^ PCT/US97/22639 

sequence-specific or monomer-specific cleavage activity. Use of these classes of enzymes allow 
a plurality of affinity assays to take place simultaneous within the same reaction vial. All 
producing mass-differentiable mass-label products. As with other mass-labeled probes described 
here, the mass label may be wholely or only partially contained within the nucleic acid or 
polypeptide substrate and may include the use of nucleic acid mimics or non-natural amino acids. 



DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 



The present invention is directed to the composition and use of releasable, nonvolatile 
mass labels for chemical analysis. The mass labels will be detectable by mass spectrometry. 
The present invention also describes novel methods utilizing mass labels of any form. The term 
nonvolatile as used herein refers to a molecule which when present in its pure, neat form and 
heated, does not sublimate intact to any significant extent. Also included in the definition of 
nonvolatile compounds are compounds which when present in their pure, neat form cannot be 
practically analyzed by mass spectrometry when conventional gas chromatography is employed 
in the sampling process. An advantage of using nonvolatile mass labels versus volatile mass 
labels, is that the sample mixtures are thereby easily physically stable after release. The mass 
labels described may be attached to a probe molecule that can specifically interact with the 
intended target. In some cases, a special release group may be included to chemically link the 
mass label to the probe. 

It is also possible to use mass labels which have negligible vapor pressure at room 
temperature but can be considered volatile by the above definition. In the present work, the 
novel mass labels released from the probe molecule evaporate insignificantly if at all at room 
temperature and are not efficient electrophores. Molecules belonging to this category are termed 
involatile mass labels. 

The compounds of the present invention are useful for detecting a wide variety of 
biomolecular interactions. Representative examples include identification of gene sequences, 
identification of non-coding nucleotide sequences, identification of mutations within a gene or 
protein sequence, detection of metals, detection of toxins, detection of receptors on an organism 
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or a cell, characterization of antibody-antigen interactions, enzyme -substrate interactions and 
characterization of ligand interactions. 



A. Mass labels 

Mass label is a term that can be used synonomously with tag or signal. Examples of the 
types of mass labels for the present invention include a repertoire of compounds, preferably ones 
that share similar mass spectrometric desorption properties and have similar or identical coupling 
chemistries in order to streamline synthesis of multiple mass label variants. A mass label of the 
present invention is detectable by mass spectrometry. Representative types of mass 
spectrometric techniques include matrix-assisted laser desorption ionization, direct laser- 
desorption, electrospray ionization, secondary neutral, and secondary ion mass spectrometry, 
with laser-desorption ionization being preferred. The dynamic range of mass spectral 
measurements can generally be extended by use of a logarithmic amplifier and/or variable 
attenuation in the processing and analysis of the signal. An example of a peptide mixture 
separated by mass spectrometry is shown in FIG. 13. 

Mass labels may include a vast array of different types of compounds including 
biopolymers and synthetic polymers. Representative biological monomer units that may be used 
as mass labels, either singly or in polymeric form, include amino acids, non-natural amino acids, 
nucleic acids, saccharides, carbohydrates, peptide mimics and nucleic acid mimics. Preferred 
amino acids include those with simple aliphatic side chains (e.g., glycine, alanine, valine, leucine 
and isoleucine), amino acids with aromatic side chains (e.g., phenylalanine, tryptophan, tyrosine, 
and histidine), amino acids with oxygen and sulfur containing side chains (e.g., serine, threonine, 
methionine and cysteine), amino acids with side chains containing carboxylic or amide groups 
(e.g., aspartic acid, glutamic acid, asparagine and glutamine), and amino acids with side chains 
containing strongly basic groups (e.g., lysine and arginine), and proline. Derivatives of the 
above described amino acids are also contemplated as monomer units. An amino acid derivative 
as used herein is any compound that contains within its structure the basic amino acid core of an 
a amino-substituted carboxylic acid, with representative examples including but not limited to 
azaserine, - fluoroalanine, GAB A, ornithine, norleucine and cycloserine. Peptides derived from 
the above described amino acids can also be used as monomer units. Representative examples 
include both naturally occurring and synthetic peptides with molecular weight above about 500 
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Daltons, with peptides from about 500-5000 Daltons being preferred. Representative examples 
of saccharides include ribose, arabinose, xylose, glucose, galactose and other sugar derivatives 
composed of chains from 2-7 carbons. Representative polysaccharides include combinations of 
the saccharide units listed above linked via a glycosidic bond. The sequence of the polymeric 
units within any one mass label is not critical; the total mass is the key feature of the label. 

The monomer units according to the present invention also may be composed of 
nucleobase compounds. As used herein, the term nucleobase refers to any moiety that includes 
within its structure a purine, a pyrimidine, a nucleic acid, nucleoside, nucleotide or derivative of 
any of these, such as a protected nucleobase, purine analog, pyrimidine analog, folinic acid 
analog, methyl phosphonate derivatives, phosphotriester derivatives, borano phosphate 
derivatives or phosphorothioate derivatives. 

Mass labels according to the present invention may also include any organic or inorganic 
polymer that has a defined mass value, remains water soluble during bioassays and is detectable 
by mass spectrometry. Representative synthetic monomer units that may be used as mass units 
in polymeric form include polyethylene glycols, polyvinyl phenols, polymethyl methacrylates, 
polypropylene glycol, polypyroles, and derivatives thereof. A wide variety of polymers would 
be readily available to one of skill in the art based on references such as Allcock et al (1981) 
which describes the properties of many additional polymers contemplated for use in the present 
invention. The polymers may be composed of a single type of monomer unit or combinations of 
monomer units to create a mixed polymer. The sequence of the polymeric units within any one 
mass label is not critical; the total mass is the key feature of the label. 

For nonvolatile mass labels having mass below about 500 Da, usually significant ionic 
character is required; representative examples include polyethylene glycol oligomers of 
quaternary ammonium salts (e.g., R-(0-CH 2 -CH2) n -N(CH 3 )3 + • CI") and polyethylene glycol 

oligomers of carboxylic acids and salts (e.g., R-(0-CH 2 -CH 2 )n-C02" ■ Na + ). 



Examples of involatile mass labels typically include small oligomers of polyethylene 
glycol and small peptides (natural or modified) less than about 500 Da in molecular weight. In 
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these instances, as for all of the cases considered herein, mass analysis is not by electron 
attachment. 

Mass labels of the present invention may also include a variety of nonvolatile and 
involatile organic compounds which are nonpolymeric. Representative examples of nonvolatile 
organic compounds include heme groups, dyes, organometallic compounds, steroids, fullerenes, 
retinoids, carotenoids and polyaromatic hydrocarbons. 

In addition to the polymer or mixed polymer mass labels described, mass-labels of the 
present invention also include mixed mass labels containing a mass-variable polymeric 
component and a nonpolymeric mass static component. A representative example includes a set 
of mass labels with a polymeric component where the number of repeat units within the set is a 
range from .about 1 0 to 1 00, and on each polymer is a compound with a fixed large mass. In a 
preferred embodiment, the mass labels within a set all contain the same mass static component. 
In this preferred set of compounds only the length of the polymer is changed to provide a set of 
mass labels with incremental increases in mass and a relatively uniform signal between mass 
labels. These compounds provide a means for using mass labels with desirable spectral 
properties but are not available in a large repertoire of different masses. 

It is preferable when using multiple mass labels on a probe, to avoid signal overlap. In 
addition to presenting a large, primary signal for a mass label with a single charge, there is also 
the potential for multiply charged versions of a mass label to present a signal as well as 
dimerized versions of a mass label. The presence of multiple signals for a single mass label can 
potentially overlap with and obscure the signal for the primary peak of a second mass label. 
Thus typically the range of mass labels used for a given analysis may have a mass range where 
no multiply charged or dimer species can interfere with the detection of all mass labels, for 
example, the mass labels may have a range of masses wherein the smallest mass-label is more 
than half the mass of the largest mass label. 

B. - Reactive Groups 

The mass label is typically attached to a reactive group. The reactive groups of the 
present invention may be any biomolecule capable of specific molecular recognition. In 



WO 98/26095 PCIYUS97/22639 

particular, the reactive group may form a specific interaction with the target molecule. This 
interaction may be noncovalent, for example, hybridization of an oligonucleotide to a DNA 
target, or covalent such as crosslinking. Representative reactive groups of the present invention 
include polypeptides, antibodies, enzymes, polynucleic acids, lipids, steroids, carbohydrates, 
antibiotics and compounds such as neocarzinostatin which have a preference for certain DNA 
sequences, with polynucleic acids preferred and oligonucleotides being more preferred. 
Representative steroid hormones include estrogens, progestins and androgens. 

Representative reactive group-target molecule interactions include oligonucleotide- 
oligonucleotide hybridization, polynucleotide-polynucleotide interactions, enzyme-substrate or 
substrate analog/intermediate interactions, poiypeptide-nueleic acid interactions, protein-iigand 
interactions, receptor-Iigand interactions, lipid-lipid interactions, carbohydrate-carbohydrate 
interactions, polypeptide-metal interactions, nucleic acid-metal interactions or antigen-antibody 
interactions. 

In certain embodiments the probe may be a synthetic oligonucleotide or enzymatically 
synthesized oligonucleotide that may be a DNA molecule, an RNA molecule, or some variant of 
those molecules, such as a peptide nucleic acid. The oligonucleotide will typically be able to 
selectively bind a substantially complementary sequence. As used herein a substantially 
complementary sequence is one in which the nucleotides generally base pair with the 
complementary nucleotide and in which there are very few base pair mismatches. The 
polynucleotide may be relatively small, such as a 1 0-mer, or larger, such as a kilobase insert in a 
plasmid or a kilobase amplified nucleic acid ("amplicon") or a long RNA transcript. The 
polynucleotide can be bigger, smaller or the same size as the target. The probe is distinguished 
from the target by the fact that the probe contains a mass label. 

Representative examples of a covalent interaction between a reactive group and a target 
include proteins as reactive groups activated with crosslinkers to form conjugates with the target 
molecule, such as antibody-antigen interactions, enzyme-substrate interactions, receptor-Iigand 
interactions, receptor-membrane interactions or a protein-nucleic acid interaction. 
Representative crosslinking reagents include chemically activated crosslinkers such as EDC or 
MBS and photoreactive crosslinkers such as SADP or PNP-DTP. 



WO 98/26095 



37 



PCT7US97/22639 



C. Methods for Releasing the Mass label 

In some embodiments, it may be important to release the mass label from all or most of 
the reactive group prior to spectrometric analysis, as represented in FIG. II for a mass-labeled 
nucleic acid probe. For this reason, a release group is desirable. A number of means may 
effectuate the release, including a labile chemical linkage between the mass label and the reactive 
group. A labile chemical linkage as used herein is any moiety which upon treatment with a 
second chemical agent, light, enzyme or heat will cleave the moiety and release the mass label. 
These linkages may include chemically cleavable groups incorporated within the phosphate 
backbone linkage .(e.g. .replacement -of phosphate -with a -phosphoramidate) or as a subsiiiuent on 
or replacement of one of the bases or sugars of the oligonucleotide primer (e.g., a modified base 
or sugar, such as a more labile glycosidic linkage). Such chemically cleavable groups would be 
apparent to one of skill in the art in light of the present disclosure and include, for example, 
dialkoxysilane, 3*-(S)-phosphorothioate, 5'-(S)-phosphorothioate, 3*-(N)-phosphoroamidate, 5- 
(N)-phosphoroamidate, and ribose. It has also been found experimentally that such groups 
cleave much more rapidly when the probe is in single-stranded form than when hybridized to a 
complementary strand. An example of this kinetic selectivity is presented in Example 9. The 
chemically cleavable site should generally be stable under the amplification, hybridization and 
washing conditions to be employed. Other examples of labile chemical, linkers consist of groups 
cleavable by oxidation such as dialkyl tartrate, base cleavable groups such as bis[2(alkoxy- 
carbonyloxy)ethyl]sulfone, silyl ethers and ketals which will cleave upon treatment with fluoride 
ion or acid, ortho-nitrobenzyl ethers which will cleave upon irradiation with light, and groups 
cleavable by reduction such as dialkyl disulfides. 

A preferred labile chemical linkage includes a disulfide bond which upon treatment with 
a sulfhydryl reagent, such as 2-mercaptoethanol, reduces the disulfide bond into two -SH groups. 
For mass labels that are chemically cleaved from probes, it may be preferable to remove or wash 
away any unincorporated reactive group monomers so that they are not visualized in the mass 
spectrometer. 
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In other embodiments of the invention, however, no additional linkage group will be 
needed, as the release group may be contained within the reactive group. Released mass labels 
therefore, may contain none, a portion, or the whole of the reactive group still attached to the 
specific mass label. Representative examples of release groups contained within a reactive group 
include the endogenous peptide linkages between amino acids in a polypeptide and the 
endogenous phosphodiester bond linkages between bases in a polynucleotide. When the reactive 
group is a polynucleotide, the mass label may be released during enzymatic (nuclease) digestion 
of the probe nucleotide backbone, or an acid-induced digestion of the probe nucleotide backbone. 
These endogenous linkages may also be modified to target a specific sequence within the 
reactive group. Examples include modified phosphodiester bonds such as phosphorothioates, 
phosphoramidates and dialkylsily! ketais. Nucleotide sequences may also be introduced for 
recognition by an endonuclease (restriction enzyme) such as Type II or Type IIS restriction 
endonucleases. In certain embodiments a phosphodiester bond will be the release group as 
recognized by an exonuclease enzyme. Temperature labile release is also contemplated. 
Representative examples include thermal melting of a hybridized oligonucleotide from a DNA 
target or temperature dependent denaturation of a protein to release a bound molecule. 

Specific peptide linkages may also be introduced within a polypeptide reactive group. 
Examples include peptide linkages which are specifically cleaved by chemicals such as a 
methionine recognized by CNBr, or tryptophan which can be cleaved by either iodosobenzoic 
acid or BNPS-skatole. Peptide linkages may also be introduced for recognition by an enzyme 
such as trypsin. 

A further example of endogenous bonds as release groups include chemical or enzymatic 
cleavage at a glycosidic bond. One skilled in the art would recognize that a wide variety of 
release approaches would be within the scope of the present invention. 

D. Selective Release of Mass labels 



In some of the embodiments described herein, involving the use of one or more different 
nucleic acid probes, use of mass-labeled nucleic acid probes may depend on the selective release 
of certain mass-labels correlating to the occurrence of a particular event. For instance, release of 



WO 98/26095 ^ PCT/US97/22639 

a mass-label may indicate that a hybridization event has occurred between a particular mass- 
labeled nucleic acid probe and a nucleic acid target sequence; An approach to selective release 
can involve targeted nuclease digestion of only hybridized probes existing in a double-stranded 
form as shown in FIG. 12. A number of nucleases, for example restriction endonucleases and 
DNase 1 , only digest double-stranded nucleic acids. Consequently treatment with such enzymes 
will only release mass-labels from nucleic acid probes that have successfully hybridized to a 
target sequence. As an alternative, a nuclease that only recognizes a nucleic acid sequence 
present in single-stranded form, including SI nuclease, could be used to yield signal and identity 
data for probes that do not undergo hybridization. 

The use of a hybridization probe of at least about 10-14 nucleotides in length allows the 
formation of a duplex molecule that is both stable and selective. Molecules having contiguous 
complementary sequences over stretches greater than 10 bases in length may be employed to 
increase the stability and selectivity of the hybrid. One may generally prefer to design nucleic 
acid molecules having complementary stretches of about 1 5 to about 20 contiguous nucleotides, 
or even longer where desired. For example, one may prefer to design nucleic acid molecules of 
about 25, about 30, about 35, about 40, about 45, or about 50 contiguous nucleotides and so on. 
In this context, the term "about" indicates that the nucleic acid molecule may vary from the 
stated length by from 1 to 4 nucleotides. For example, "about 25" may be understood to include 
21, 22, 23 and 24; "about 30" may be understood to include 26, 27, 28 and 29; "about 35 may be 
understood to include 31,32, 33 and 34; and so on. 

Hybridization probes may be selected from any portion of a target sequence. The choice 
of probe and primer sequences may be governed by various factors, such as, by way . of 
exemplification and not limitation, one may employ primers from regions near the termini of the 
total sequence, or from the ends of the functional . domain-encoding sequences or one may 
employ probes corresponding to the entire DNA. Probes may be designed to identify 
homologous genes between species including human or one may employ wild-type and mutant 
probes or primers with sequences designed to identify human or other non-human subjects that 
carry a certain mutation and thus may be susceptible to disease or a pharmaceutical agent. 
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Variable parameters for hybridization include temperature, time, salt 
concentration and formamide concentration. Hybridization is understood to mean the formation 
of stable, anti-parallel duplex molecules based on the specific hydrogen bonding of 
complementary nucleotide bases of the nucleic acid molecules. 

The tendency for two complementary strands of nucleic acid in solution to anneal 
or hybridize by forming hydrogen bonds between their complementary bases, is critically 
dependent on the concentration of monovalent or divalent cations in the solution. Sodium (Na + ), 
has been the cation of choice for determining the effects of salt concentration on the stability of 
duplex nucleic acids. Above a threshold Na + concentration, two complementary single strands 
(either DNA or RNA) of nucleic acid will hydrogen bend through interaction of tlic bases in each 
strand, to form a double-stranded molecule of DNA, RNA, or even a DNA-RNA heteroduplex. 
Complementary bases are adenosine (A) and thymidine (T) (in DNA), or adenosine and uridine 
(U) (in RNA), and cytosine (C) and guanine (G) in both DNA and RNA. Two hydrogen bonds 
are formed between paired A and T or A and U residues, while C-G base pairing results in the 
formation of three hydrogen bonds. The G-C base pair is therefore a stronger interaction than the 
A-U or A-T base pair. In general, hydrogen bonding (leading to duplex formation) does not 
occur between non-complementary bases. The ability of two single strands to form a stable 
double-stranded duplex depends on the sequence of bases in each strand being complementary to 
the other, such that when the strands are aligned in an antiparallel orientation, sequential 
juxtaposed bases are able to form hydrogen bonds. Although hydrogen bonding between any 
two complementary bases provides only a weak binding energy, the cumulative binding energy 
between many sequential paired bases provides sufficient attractive forces to hold the strands 
together in a stable duplex. Cations enhance the tendency for complementary strands to form 
hydrogen bonds, by masking the negative charges of the phosphate groups in the phosphodiester 
linkages which form the "backbone" of the nucleic acid strands. At low concentrations of 
positively charged ions, repulsive forces between negatively charged strands favor their single- 
stranded or denatured conformation; as cation concentration is raised, the negative charges are 
masked, complementary bases pair through hydrogen bonding, and a duplex nucleic acid 
molecule is formed. In a duplex containing a mismatched (non-complementary) base pair, the 
single unpaired position in the two otherwise complementary strands provides the target for the 
single-strand specific RNase in the RNase protection assay. 
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Other parameters besides cation concentration affect the tendency of 
complementary strands to exist in the alternative double-stranded or single -stranded 
conformations. Temperature is a critical variable; as the temperature of a solution of duplex 
nucleic acid molecules is raised, hydrogen bonds are broken first in A-U rich regions and finally 
in G-C rich regions, until above a critical temperature, the complementary strands come apart. 
The composition of the two strands, i.e., their % GC content, determines the critical temperature 
for duplex denaturation at a given ionic strength. As a corollary, the % GC also determines the 
threshold concentration of Na + needed to maintain duplex stability at a given temperature. 
Stability of duplex nucleic acid molecules in solution is also affected by the nature of the solvent. 
For example, duplexes are much less stable in formamide (which destabilizes hydrogen bonds) 
than in aqueous solution, a fact exploited by molecular biologists to achieve nucleic acid 
hybridization at lower temperatures than would otherwise be required. 

Equations have been derived to relate duplex formation to the major variables of 
temperature, salt concentration, nucleic acid strand length and composition, and formamide 
concentration. 

Eg: 

1. Tm = 81.5 - 16.6(log[Na + ]) + 0.41(%GC) - 600/N 

(Tm = temperature for duplex to half denature; N = chain length 

2. Tm = 81.5 - 16.6(log[Na + ] + 0.41(%GC) - 0.63(% formamide) - 600/N 

One can thus predict whether complementary strands will exist in double-stranded or 
single-stranded form under a given set of conditions. If conditions are chosen such that 
complementary strands form a stable duplex, the duplex will in theory be resistant to the 
nucleolytic action of enzymes (DNases and RNases) which are specific for cleavage of 
phosphodiester bonds in single-stranded molecules. Many different types of nucleases exist, 
which vary widely in their substrate specificities. The RNases commonly used in RNase 
protection assays are specific for cleavage after particular bases in single-stranded RNA 
molecules. Below the threshold Na + concentration needed to maintain duplex stability, the 
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complementary RNA strands denature into single strands, which are then substrates for 
degradation by the RNases. Susceptibility to digestion by RNase A is therefore a functional 
assay for whether complementary strands exist as single-stranded or double-stranded molecules. 

Hybridization 

Standard annealing or hybridization procedures are described by Sambrook et al (1989). 
Generally they entail two or more nucleic acids, for example probe and test sample nucleic acids, 
to be mixed together, denatured and then subjected to conditions in which complementary 
strands anneal, or base pair by hydrogen bonding to form double strands. The annealed strands 
are said to be hybridized. For example, the mixture may be heated to from about 90°C io about 
95°C for about three minutes and then gradually cooled to a lower temperature, 42°C for 
example, for a period of time sufficient to allow hydrogen bonding of the complementary 
strands. The time required for annealing of complementary strands depends on the concentration 
of each strand and will vary from a few minutes (for reactions where both probe an test nucleic 
acids are present at high concentrations), to several hours or overnight for reactions having at 
least one species present at low concentration. It is therefore advantageous to use high 
concentrations of probe and test sample nucleic acids, such as may be generated by PCR 
amplification and/or transcription of PCR amplified sequences. 

Depending on the application envisioned, one may employ varying conditions of 
hybridization to achieve varying degrees of selectivity of the probe towards the target sequence. 
For applications requiring high selectivity, one may typically employ relatively stringent 
conditions to form the hybrids, e.g., relatively low salt and/or high temperature conditions, such 
as provided by 0.02M-0.15M NaCl at temperatures of 50°C to 70°C. Such selective conditions 
tolerate little, if any, mismatch between the probe and the template or target strand. 

Of course, for some applications, for example, where one desires to identify mutants 
employing a mutant primer strand hybridized to an underlying template or where one seeks to 
isolate protein-encoding sequences from related species, functional equivalents, or the like, less 
stringent hybridization conditions may typically be employed to form the heteroduplex. In these 
circumstances, one may employ milder hybridization conditions, such as 0.15M-0.9M salt, at 
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temperatures ranging from 20°C to 55°C. Cross-hybridizing species can thereby be readily 
identified as positively hybridizing signals with respect to control hybridizations. Additionally, 
conditions may be rendered more stringent by the addition of increasing amounts of formamide, 
which serves to destabilize the hybrid duplex in the same manner as increased temperature. 
Thus, hybridization conditions may be readily manipulated to achieve the desired results. 

Release Methods 

The use of nucleases that selectively digest mass-labeled nucleic acid probes hybridized 
to a target nucleic acid allows for linear amplification of signal: For example, one may employ a 
.nuclease capable of digesting only the nucleic acid probe and not the target, e.g., a double-strand 
specific exonuclease to digest a short, linear probe in the presence of a circular target having no 
end to enable the initiation of exonuclease digestion. Long linear targets may also be used in 
cases where the exonuclease requires a recessed or blunt double-stranded end. As a probe 
hybridizes to the target, it is digested, and the digested fragments release from the target and 
make room for a second copy of the probe to hybridize. The second probe is then digested, and, 
once again, the target is free for the next hybridization. The repeated cycles of hybridization and 
digestion leads to a linear amplification of the amount of released mass label in solution, 
consequently increasing the mass spec trometric signal. It is possible to achieve a many hundred- 
fold amplification of signal using such a system. See Okano and Kambara, 1995 (exonuclease 
III); Copley and Boot, 1992 (lambda exonuclease). 

Nonselective release events may also be employed with the methods disclosed herein. 
For example, nonselective cleavage of a disulfide releasing group using a chemical agent such as 
a phosphine or a mercaptan may be used. 

In certain embodiments, detection of the desired label may depend on specific 
partitioning of the population of reactive groups or targets. Reactive groups that recognize and 
bind to a particular target may, for example, be immobilized to a specific location. For instance, 
a target sequence or sequences of nucleic acids may be attached to gridded positions on a solid 
support such as a filter, glass, gold or to a bead or a group of beads. Mass-labeLed 
oligonucleotides (probes) that do not hybridize to the target sequence may then be separated from 
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probes hybridized to immobilized targets simply by washing the filter or beads. Such approaches 
may be especially preferred for removal of unhybridized probes where a subsequent nonspecific 
release mechanism is to be employed. The reverse case may also be employed, in which the 
labeled probes are immobilized, and the targets are hybridized to them. 

Methods described herein may involve the use of a nucleic acid amplification event, such 
as polymerase chain reaction (referred to as PCR™), to link a mass-labeled nucleic acid probe, 
used specifically as a primer, to a second primer that is capable of or presently is bound to a solid 
support. An example of a second primer is one that contains a.biotin moiety. Similarly to the 
embodiment described above, binding of the amplification product to the solid phase affords a 
mechanism to wash away unused primers and then to ■nonseleetively release the remaining mass 
labels. 

A nucleic acid amplification event, involving the use of one or more different nucleic 
acid probes, may also be used to convert mass-labeled nucleic acid probes, used specifically as a 
primers, from single-stranded form to double-stranded form. This conversion allows the use of a 
double-strand-specific nuclease to selectively release only those mass labels that were attached to 
primers involved in amplification events. Unused primers remain single stranded and will not 
release their attached mass labels. 

Other methods described herein as part of the present invention, involving the use of one 
or more different nucleic acid probes, may involve the modification of a select population of 
probes following their hybridization to a target which would allow for the partitioning of the 
probe population. Such methods include double-strand dependent addition of biotinylated 
nucleotides or oligonucleotides to the end of mass-labeled probes using polymerase or ligase, 
followed by direct capture of the biotinylated probes to a streptavidin modified surface. 

As another option, analysis of mass-labeled nucleic acid probes by MALDI mass 
spectrometry may be performed using a matrix that selectively desorbs and efficiently ionizes 
intact released mass labels but not mass labels still coupled to their respective nucleic acid 
probes. Nucleic acid molecules often do not desorb well in many matrices which are yet 
effective for the desorption of released mass labels, and this difference can be accentuated by the 
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presence of impurities such as salts. Mass-labeled nucleic acid probes may typically be analyzed 
by direct laser-desorption mass spectrometry without further purification if, for example, the 
released mass label(s) are detected much more efficiently than unreleased labels. The same holds 
true for other forms of mass spectrometry. Thus, in a preferred embodiment using laser- 
desorption mass spectrometry, physical partitioning of the released and unreleased mass labels 
may not be required. One skilled in the art in light of the present disclosure can envision the use 
of a variety of other techniques for selectively partitioning probes involving probe-label 
synthesis, label release, and label mass spectral detection, in various combinations. 

E. Synthetic Techniques 

Mass labels may be' added to the reactive group during synthesis, or the reactive group 
may be modified after synthesis. For example, the modification of nucleic acid or amino acid 
building blocks provides a convenient route for developing generalized methods of mass-labeling 
reactive groups during synthesis. For example, as the polypeptide or polynucleic acid is being 
synthesized, different mass-labeled nucleotides or amino acids may be added to the mixture and 
incorporated into the growing polymer. A generalized example of a mass-labeled nucleoside 
triphosphate is depicted in FIG. 1A. One skilled in the art would in light of the present 
disclosure envision a variety of attachment schemes and positions of attachment. Generally, the 
attachment of a mass label should not substantially inhibit the interaction between the reactive 
group and target molecule, such as the hydrogen-bonding of the mass-labeled base and the 
complementary target base, or disrupt the proper folding of a. polypeptide to form an active 
protein. Furthermore, in the case of a mass-labeled nucleoside triphosphate, the label should 
typically not inhibit polymerization by a polymerase enzyme. 

One synthesis approach of the present invention, involves the use of mass label modified 
nucleoside triphosphates that are incorporated by a polymerase to produce a mass-labeled 
polynucleotide. Using this method, it is easy to load a nucleic acid probe with many copies of a 
mass label. Polymerase -based methods allow for the inexpensive synthesis of very long probes 
hundreds to tens of thousands of bases in length by incorporation into an RNA transcript or 
PCR™ amplicon. 
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Where the reactive group is a protein, the mass label may be a length of amino acids 
forming a peptide attached to either the carboxyl or amino terminus of the protein. The 
composition of the mass label may be coded directly into the DNA sequence immediately 
adjacent to the coding region of the protein that represents the reactive group. Subsequent 
transcription and translation of this DNA sequence yields a product whereby the peptide mass 
label is fused to the protein. 

F. Enzymatic Amplification Techniques 

Nucleic acid amplification methods may be used to prepare mass-labeled probes or to 
detect the presence of a target sequence. One of the best known amplification methods is the 
PCR™ which is described in detail in U.S. Patent 4 S 683 S .195 S U.S. Patent 4,683,202, and U.S. 
Patent 4,800,159, each incorporated herein by reference, and in Innis et al (1990, incorporated 
herein by reference). 

In PCR™, two primer sequences are typically prepared which are complementary to 
regions on opposite complementary strands of the target sequence. The primers may hybridize to 
form a nucleic acid:primer complex if the target sequence is present in a sample. An excess of 
deoxynucleoside triphosphates are also added to a reaction mixture along with a DNA 
polymerase, e.g., Tag polymerase, that facilitates template-dependent nucleic acid synthesis. 

If the marker sequence:primer complex has been formed, the polymerase will cause the 
primers to be extended along the marker sequence by the addition of nucleotides. By raising and 
lowering the temperature of the reaction mixture, the extended primers will dissociate from the 
marker to form reaction products, excess primers will bind to the marker and to the reaction 
products and the process is repeated. These multiple rounds of amplification, referred to as 
"cycles", are conducted until a sufficient amount of amplification product is produced. 

A reverse transcriptase PCR™ ("rtPCR™") amplification procedure may be performed in 
order to quantify the amount of mRNA amplified. Methods of reverse transcribing RNA into 
cDNA are well known and described in Sambrook et al., 1989. 
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Another method for amplification is the ligase chain reaction ("LCR"), disclosed in 
European Patent Application No. 320,308, incorporated herein by reference. In LCR, two 
complementary probe pairs are prepared, and in the presence of the target sequence, each pair 
will bind to opposite complementary strands of the target such that they abut. In the presence of 
a ligase, the two probe pairs will link to form a single unit. By temperature cycling, as in PCR™, 
bound ligated units dissociate from the target and then serve as "target sequences" for ligation of 
excess probe pairs. U.S. Patent 4,883,750, incorporated herein by reference, describes a method 
similar to LCR for binding probe pairs to a target sequence. 

Qbeta Replicase, described in PCT Patent Application No. PCT/US 8 7/00880, may also 
be used as still another amplification method in the present invention, in this method, a 
replicative sequence of RN A which has a region complementary to that of a target is added to a 
sample in the presence of an RNA polymerase. The polymerase will copy the replicative 
sequence. 

An isothermal amplification method, in which restriction endonucleases and ligases are 
used to achieve the amplification of target molecules that contain nucleotide 5'-[alpha-thio]- 
triphosphates in one strand of a restriction site may also be useful in the amplification of nucleic 
acids in the present invention. Such an amplification method is described by Walker et ai (1992, 
incorporated herein by reference). 

Strand Displacement Amplification ("SDA") is another method of carrying out 
isothermal amplification of nucleic acids which involves multiple rounds of strand displacement 
and synthesis. A similar method, called Repair Chain Reaction (RCR), involves annealing 
several probes throughout a region targeted for amplification, followed by a repair reaction in 
which only two of the four bases are present. The other two bases can be added as biotinylated 
derivatives for easy detection. A similar approach is used in SDA. 

Target specific sequences may also be generated using a cyclic probe reaction ("CPR"). 
In CPR, a probe having 3' and 5' sequences of non-specific DNA and a middle sequence of 
specific RNA is hybridized to DNA which is present in a sample. Upon hybridization, the 
reaction is treated with RNase H, and the products of the probe identified as distinctive products 
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which are released after digestion, 
and the reaction is repeated. 
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The original template is annealed to another cycling probe 



Other nucleic acid amplification procedures include transcription-based amplification 
systems ("TAS"), including nucleic acid sequence based amplification ("NASBA") and 3SR 
(Kwoh etal, 1989; PCT Patent Application WO 88/10315, each incorporated herein by 
reference). 

In NASBA, the nucleic acids may be prepared for amplification by standard 
phenol/chloroform extraction, heat denaturation of a clinical sample, treatment with lysis buffer 
and minispin columns for isolation of DNA and RNA or -guanidinium chloride extraction of 
RNA. These amplification techniques involve annealing a primer which has target specific 
sequences. Following polymerization, DNA/RNA hybrids are digested with RNase H while 
double stranded DNA molecules are heat denatured again. In either case the single stranded 
DNA is made fully double stranded by addition of second target specific primer, followed by 
polymerization. The double-stranded DNA molecules are then multiply transcribed by a 
polymerase such as T7 or SP6. In an isothermal cyclic reaction, the RNA's are reverse 
transcribed into double stranded DNA, and transcribed once again with a polymerase such as T7 
or SP6. The resulting products, whether truncated or complete, indicate target specific 
sequences. 



European Patent Application No. 329,822 (incorporated herein by reference) disclose a 
nucleic acid amplification process involving cyclically synthesizing single-stranded RNA 
("ssRNA"), single-stranded DNA ("ssDNA"), and double-stranded DNA ("dsDNA"), which may 
be used in accordance with the present invention. 

Following amplification, it may be desirable to separate the amplification product from 
the template and the excess primer for the purpose of determining whether specific amplification 
has occurred. In one embodiment, amplification products are separated by agarose, agarose- 
acrylamide- or polyacrylamide gel electrophoresis using standard methods (Sambrook et al, 
1989). 
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Alternatively, chromatographic techniques may be employed to effect separation. There 
are many kinds of chromatography which may be used in the present invention: adsorption, 
partition, ion-exchange and molecular sieve, and many specialized techniques for using them 
including column, paper, thin-layer and gas chromatography (Freifelder, 1982). 

Separation may also be achieved using biologically based interactions such as biotin- 
streptavidin or antibody-antigen interactions. 

In embodiments where the mass labels have been incorporated into the product, detection 
of the mass labels may be used to confirm amplification. When the mass label is to be added 
later, amplification products should, typically be visualized in order to confirm amplification of 
the sequences. One typical visualization method involves staining of a gei with ethidium 
bromide and visualization under UV light. . Alternatively, if the amplification products are 
integrally labeled with radio- or fluorometrically-labeled nucleotides, the amplification products 
may typically be exposed to x-ray film or visualized under the appropriate stimulating spectra, 
following separation. 

G. Chemical Synthesis Techniques 

If the probe is chemically synthesized, the mass label may be placed at one or more 
locations within the reactive group. For example, polypeptide compounds of the present 
invention may be synthesized using known methods for peptide synthesis (Atherton & Shepard, 
1989). The preferred method for synthesis is standard solid phase methodology, such as that 
based on the 9-fluorenylmethyloxycarbonyl ("FMOC") protecting group (Barlos et ai, 1989), 
with glycine-functionalized o-chlorotrityl polystyrene resin. Solid phase peptide synthesis 
allows for strategic placement of a mass label within the compound. Similarly, an 
oligonucleotide probe, for example, may be specifically labeled by introducing a modified mass- 
labeled phosphoramidite at a particular location within the sequence. Chemical synthesis 
methods also permit the placement of mass labels at the termini of the probe or within an internal 
linker wherein the mass label is not directly attached to the base of a nucleotide. A generalized 
example of a mass-labeled phosphoramidite is shown in FIG. IB. Chemical synthesis methods 
for DNA are well known within the art (Agrawal, 1993) 
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The use of combinations of different mass labels can greatly enlarge the number of 
unique mass signatures that are available when making a library of nucleic acid probes, while 
needing only a modest set of different mass label components. As an example, using 
polymerase-based methods and a repertoire of 40 different mass-labeled thymidine triphosphate 
nucleotides each with a unique mass label, one may synthesize an enormous array of 
differentially labeled probes. If combinations of two different mass labels out of the 40 are used 
for each probe then a total of 780 probes may be made each with a unique, two-mass signature [= 
40!/(2!.38!) = 780]. If three different labels are used per probe then 9,880 different combinations 
are possible [= 40!/(3!.37!) = 9,880]. The trend continues using the example of combination of 
sets of mass labels from a pool of 40 label molecules as follows: a set of four . labels yields 
91,390 possible combinations, five labels yields 658,008 possible -combinations, six labels yields 
3,838,380 possible combinations and so on. Conceivably probes may be made with a unique 
mass label signature for every gene within humans, and any other organism for that matter. 
Examples of enzymatic probe synthesis are shown in FIG. 4C and FIG. 4D. 

An alternative to the use of mixtures of mass-labeled nucleotides, is the use of mixtures 
of mass-labeled primers. Nucleic acid probes prepared by an amplification method, such as 
PCR™, may utilize mixtures of primers whereby each primer contains a different mass label and 
the same DNA sequence. As with the mass-labeled nucleoside triphosphates, a repertoire of 
mass labeled primers may be used to prepare many different mass signatures. In addition to 
using mixtures of primers with a single type of mass label, primers may be prepared containing 
several different mass labels within a single molecule. 

A particular advantage to the solid phase method of synthesis is the modification of these 
compounds using combinatorial synthesis techniques. Combinatorial synthesis techniques are 
defined as those techniques producing large collections or libraries of compounds 
simultaneously, by sequentially linking different building blocks. Libraries can be constructed 
using compounds free in solution, but preferably the compound is linked to a solid support such 
as a bead, solid particle or even displayed on the surface of a microorganism. Several methods 
exist for combinatorial synthesis (Holmes et al, 1995; Burbaum etal, 1995; Martin etal, 1995; 
Freier etal., 1995; Pei etai, 1991; Bruce etal, 1995; Ohlmeyer et ai 9 1993); including split 
synthesis or parallel synthesis. Split synthesis may be used to produce small amounts of a 
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relatively large number of compounds, while parallel synthesis may produce larger amounts of a 
relatively small number of compounds. In general terms, using split synthesis, compounds are 
synthesized on the surface of a microparticle. At each step, the particles are partitioned into 
several groups for the addition of the next component. The different groups are then recombined 
and partitioned to form new groups. The process is repeated until the compound is completed. 
Each particle holds several copies of the same compound allowing for facile separation and 
purification. Split synthesis can only be conducted using a solid support. 

An alternative technique known as parallel synthesis may be conducted either in solid 
phase or solution. Using parallel synthesis, different compounds are synthesized in separate 
receptacles, often using automation. Parallel synthesis may be conducted in microtiter plate 
where different reagents can be added to each well in a predefined manner to produce a 
combinatorial library. Parallel synthesis is the preferred approach for use with enzymatic 
techniques. It is well understood that many modifications of this technique exist and can be 
adapted for use with the present invention. Using combinatorial methods, a large number of 
unique mass-labeled probes may be synthesized. 

One embodiment is an approach to synthesizing all possible combinations of sequence 
simultaneously in such a way that each unique sequence within the pool will possess a unique 
mass signature. The synthetic approach involves the use of a unique set of four mass-labeled 
nucleotides for each position within an oligonucleotide probe, i.e., a set of four mass labels are 
used exclusively at position 1, while a different set of four is used exclusively at position 2, and 
so on. The primary method of synthesizing said probes is chemical using phosphoramidite 
chemistry though other chemical and enzymatic methods including single base addition by 
polymerase may also be employed. As an example, synthesis of the combinatorial set of all 
oligonucleotides 10 bases long would require 40 different phosphoramidites, 10 different A's 
with unique mass labels, 10 different C's with unique mass labels, 10 different G's with unique 
mass-labels, and 10 different T's with unique mass labels. The scheme is illustrated in FIG. 4A. 

Utility for the complete probe set is diverse. Applications include hybridization assays 
for identity of cDNAs of other sequences present in a solid phase bound array or some other 
format, mapping applications, and other diagnostic applications. It is also possible to use the set 
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for random PCR™ amplification assays where the products are separated by electrophoresis and 
the primers that paired to form the different PCR™ products are identified. These applications 
also apply to the methods used to identify short sequence reads. 

The combinatorial synthesis of probes can be performed as a single reaction in a single 
receptacle, or it may be performed using the split synthesis technique previously described. If 
the combinatorial synthesis does not utilize split synthesis techniques, there may be difficulties 
identifying sequence in cases where multiple probes hybridize. In cases where the full set of 
probes are used it may be difficult to uniquely identify the sequences of the probes if more than 
one probe is present at a significant level. One possible approach to limiting the number of 
probes that hybridize to a particular target is by attaching a unique ■anchoring sequence to the 
probe set limiting the locations where the probe can hybridize. This anchoring is similar to the 
methods used to identify short sequence reads. As described previously, it may also be possible 
to add extra bases to the end of the . probe to lengthen the sequence determination and improve 
discrimination, if necessary. 

A specific example of using the anchored, combinatorially synthesized probes is shown 
in FIG. 4B. In the case of screening genomic or cDNA clone inserts, the anchored, invariant 
sequence may be used to hybridize to the know vector sequence immediately adjacent to the 
insert or in the specific case of a cDNA insert to the poly A/T region of the insert. 

For addition of labels to an already synthesized probe, herein referred to as post- 
modification, various chemically active sites on the probe may be utilized. For example, a 
proper functionality of a label could be reacted with a primary amine on 5 propargyl amino 
deoxyuridine, a terminal amino or carboxyl linker, or an endogenous moiety, such as the 
exocyclic amine in cytosine, guanine, or adenine. Potential linker groups include the 
heteobifunctional cross-linking agent mal-sac-HNSA (Bachem Inc., Torrence, CA), or any of a 
variety of cross-linking agents available from Pierce Chemical Company (Rockford, IL). One 
skilled in the art could in light of the present disclosure supply other examples. Post 
modification also allows for the addition of multiple mass labels. 
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I. Assays with nonvolatile, releasable mass-labeled probes 

The described mass-labeled nucleic acid probes have a variety of uses. Labeled 
polypeptides may be used to detect interaction of a reactive group with a specific target. 
Representative examples include a mass-labeled antibody to detect an antigen either in solution 
or on a solid support or a mass-labeled enzyme to detect a substrate. One of skill in the art would 
recognize there are many such interactions detectable using labeled polypeptides to detect 
interactions with a target molecule. 

One preferred embodiment of the invention relates to the simple detection of a specific 
target nucleic acid. 

There are a variety of reasons for detecting a particular nucleic acid sequence. These 
reasons include, but are not limited to, detection of infectious agents within a clinical sample, 
detection of an amplification product derived from genomic DNA or RNA or message RNA, or 
detection of a gene (cDNA) insert within a clone. Simple detection may employ any 
combination of the methods described herein for the preparation of the nucleic acid probe and the 
release and detection of the mass label. One may also quantify the amount detected. Most of 
these methods involve the use of a hybridization-specific event to trigger the release of a mass 
label, and in cases where only small amounts of target material are present, the use of an 
amplification technique. 

An advantage to using mass-labeled compounds that are detectable by mass spectrometry 
methods is the ability to simultaneously detect many target compounds at the same time. Due to 
broad overlapping spectrums produced by existing fluorescent chromophores, an upper limit for 
fluorescence multiplexing is most likely to be about ten different labels. With a matrix-assisted 
laser desorption/ionization time-of-flight ("MALDI-TOF") mass spectrometer or direct laser- 
desorption mass spectrometer or an electrospray mass spectrometer, multiplexing of tens of 
hundreds and perhaps even thousands of different mass labels is possible. A nonvolatile pool of 
labels may provide a wider range of masses and structures. Due to this multiplexing ability, not 
only can many labeled probes be used at the same time, any individual probe can be labeled with 
many different labels. 
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J. Single Nucleotide Polymorphism Detection 

Further embodiments involve the detection of single base variations. These applications 
will generally require a great deal of sensitivity. These applications include detection of "hot 
spot" point mutations and identification of the base at single nucleotide polymorphism ("SNP") 
sites. Mass-labeled probes may be prepared that hybridize immediately adjacent to a 
polymorphic site and a polymerase may then be used to add one base at the site of the 
polymorphism. The particular base may be added to the probe by many ways. For example, in a 
preferred embodiment where a single probe is used, a mixture of the four chain terminating 
triphosphates may be added, each with a unique mass label attached. In the homozygous SNP 
case only one of the four chain-terminating nucleotides may add to the end of the probe coupling 
the associated mass label to the probe. Several approaches may be taken in releasing the mass 
label from the probe. These approaches include, but are not limited to, the use of chemically 
labile functional groups linking the mass label to the terminating nucleotide, chemically labile 
functional groups within the backbone of the extended primer or the chain-termination 
nucleotide, or the use of an enzyme to cleave at one or more of the phosphodiester or glycosidic 
linkages within the primer extension product. In cases where the mass label release point is 
within the backbone of the extension product, the released mass label may include the terminal 
nucleotide or some mass-modified version thereof. In another version where the release point is 
internal to the primer extension product, the native chain-terminating nucleotides themselves 
may serve as all or a portion of the mass labels since each base possesses a unique mass. In 
cases where the mass label is chemically cleaved from the probe, any unincorporated nucleotides 
may first be removed or washed away so that they are not visualized by the mass spectrometer. 

Partitioning of the hybridized mass-labeled chain-terminating triphosphate may be done 
on the basis of mass differences, as labeled triphosphate hybridized to a target-hybridized probe 
will have a higher molecular weight than a labeled triphosphate that is not. The probe or target 
may also be attached to a solid-phase via a number of means including biotin/streptavidin or 
chemical coupling or UV cross-linking. An alternative is the use of a nuclease to digest the 
mass-labeled probe. Using a nuclease the mass-labeled chain-terminating nucleotide will be 
released as a monophosphate. The unincorporated mass-labeled chain-terminating nucleotides 
will remain as triphosphates, and the resulting mass shift to monophosphate will indicate which 
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nucleotide was incorporated. This nuclease method relieves the necessity to remove 
unincorporated nucleotides prior to analysis. 

Another embodiment encompasses the multiplexing of a large number of probes so as to 
detect many SNPs simultaneously. Preferably mass labels may be present to uniquely tag each 
of the probes that comprise the pool. The addition of a biotinylated chain-terminating nucleotide 
at the site of the point polymorphism may also be used to segregate the probe population 
depending on which probes incorporate a specific biotinylated chain-terminating nucleotide and 
which do not. As an example, the pool of mass-labeled probes with target may be divided into 
four reactions. The first reaction would contain only biotinylated dideoxy adenosine 
triphosphate, the second Would contain only biotinylated dideoxy cytidine triphosphate, the third 
only biotinylated dideoxy guanidine triphosphate, and the fourth only ■ biotinylated dideoxy 
thymidine triphosphate. Following a single base extension polymerase-dependent reaction in the 
presence of the proper nucleotide, the extended products are captured, washed and the mass 
labels are released for mass spectrometric analysis. In the first reaction only those mass-labeled 
probes that incorporate an A will be visualized. In the second reaction only those mass-labeled 
probes that incorporated a C will be visualized. For the third and fourth reactions probes that 
incorporated, respectively a G or a T will be visualized. It is expected that hundreds of probes 
could be multiplexed in this way. 

A person skilled in the art could identify a number of variations of the single or 
multiplexed probe approach for reading out the SNP based on either the absence or appearance of 
the mass label or mass change occurring in the mass label: Another example of mass change 
within a mass label is the case where the mass label is present at the 3' end of the probe. 
Following polymerase-dependent base extension, the mass label may be released, including the 
chain terminating base addition as well as the penultimate base. A possible structure for this type 
of probe is shown in FIG. 2. Placement of the mass label and the release site may be at other 
bases with a preference of placement near the 3' end. In all cases the mass label should 
preferably be placed between the release group and the 3' end. In other embodiments it may be 
preferred to perform what is effectively a short chain terminated sequencing reaction, where, in " 
addition to dideoxy nucleotides, some amount of normal deoxy nucleotides are present. 
Extension of the primer will result in a nested set of products, each being chain terminated by a 
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dideoxynucleotide correlating to its complementary base on the template strand. In the preferred 
form, the mass label may be located within the primer near the 3' end which contains a chemical 
release group. Such a method offers a separate embodiment for short sequence reads as well as 
detection of one or more SNPs. All of the SNP detection methods described above may involve 
the use of mass modified forms of the different nucleotides in order to enhance the mass 
difference between the different possible products. 

An alternative preferred embodiment to single base addition for detecting an SNP is the 
performance of a discriminating exonuclease event in the presence of matching and mismatching 
oligonucleotide probes. One example of this approach is to combine the use of releaseable mass 
labels with nick translation PGR™, in addition to its polymerase activity, Taq DNA polymerase 
has both 5' to 3' exonuclease and endonuclease activities. If a fully complementary 
oligonucleotide probe is placed in the path of polymerization, for example during PCR™ 
amplification, the polymerase will attack the 5* end of the probe with its exonuclease activity, 
digesting the molecule until it is too small to remain hybridized. However, if the oligonucleotide 
is not perfectly complementary near the 5' end, e.g., a mismatch is present nearby, then the end 
of the probe will fray and be attacked by the endonucleolytic activity of the polymerase rather 
than the exonuclease activity. The nucleolytically cleaved product, preferably containing the 
mass label, will have a different final mass depending on whether or not a mismatch was present 
and how the nuclease cut in response to this mismatch. It has been demonstrated that the 
initiation of endonucleolytic activity can be influenced by the presence and placement of a 
mismatch within the hybridization probe (Holland et al, 1991; Lee et aL, 1993). Selective 
placement of a mass label within the oligonucleotide probe relative to the expected mismatch site 
can be used to yield a differential signal depending on whether or not an actual mismatch is 
present. 

By taking advantage of the high multiplexing capability of mass-labeled probes, one can 
extend this assay to the simultaneous detection of multiple SNPs. Each of the probes targeting a 
particular SNP contains one of the four possible bases to complement the site of polymorphism. 
The placement of the mass label is such that if the probe contains a perfect match to the template, 
the mass label will be released by the exonuclease activity of Taq polymerase, primarily in a 
form that includes a single nucleotide. The other probes will create a mismatch and the 
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endonuclease activity of the polymerase will initiate cutting of the probe in such a way that the 
mass label remains bound to a larger segment of the probe that includes more than one 
nucleotide. The shift in mass of the mass label cleavage product is diagnostic of whether or not a 
mismatch has occurred. 



When the detection by mass spectrometry is performed using MALDI it may be possible 
to select a matrix that can visibly discriminate between the smaller product that results from the 
matching probe and the larger product that results from the mismatched probes such that the 
smaller product is desorbed more efficiently or selectively. Utilizing a matrix such as 2,5- 
dihydroxybenzoic acid s sinapinic acid, or a-cyano-4-hydroxycinammic acid, the signal strength 
decreases as more nucleotides are attached to the probe (Jensen, et ai., 1996). 

By using a set of 50 mass-labeled probes, as many as 25 biallelic SNPs may be detected 
in a single tube. As is the case with any PCR™ based detection scheme, the limit of SNPs to be 
detected will more likely be the result of the limits of multiplexing PCR™. The process, when 
coupled to high throughput mass spectrometric analysis, can be especially cost efficient when 
analyzing a small set of polymorphic sites, e.g., in a cluster of exons, as part of a population 
study where thousands to tens of thousands of samples need to be analyzed. 

Nick translation PCR™ combined with mass-labeled probes can also be used as a 
generalized method for the detection and monitoring of a PCR™ amplification reaction. In this 
case, only matching probes are present and the mass label is released, only if PCR™ of the 
particular region targeted by a particular probe is amplified. 

While the preferred embodiment for these assays is to use nonvolatile releasable mass 
labels or involatile releasable mass labels, other types of labels can be used as well, such as 
isotopic mass labels, volatile mass labels (including electrophores), fluorescent labels, and 
chemiluminescent labels. 

K. Short Sequence Reads 

In another preferred embodiment of the invention, the mass-labeled probes may be used 
to identify short sequences. In particular, combinations of hybridization and enzymatic 
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(polymerase or ligase) extension can be employed with the labeled probes to identify short 
sequence runs adjacent to a "priming" or anchoring region. There are three optimal methods for 
doing this. The first method is illustrated in FIG. 3A. A mixture of probes are synthesized 
containing two domains, a fixed sequence recognition domain, typically comprised of only one 
or a few sequences, and a randomized domain, comprising the full set (or some subset) of all 
possible sequences. The" fixed sequence of the probe is used to target hybridization of the probe 
to a single site within a particular target nucleic acid. This target site is typically invariant. The 
sequence adjacent to the invariant sequence is variable and, depending on the particular target, 
can have any one of the total combinations of sequence. In order to probe for all possibilities it 
is necessary to synthesize probes containing all the possible secondary domain sequence 
combinations. If the second probe region is four bases in length, then 256 different pjfobes need 
to be synthesized. If the second probe region is five bases in length, then 1024 different probes 
need to be synthesized. Six bases requires 4096, and so on. The probes can be synthesized . 
individually, each possessing a unique combination of mass labels as a releasable mass signature. 
Alternatively, the probes can be synthesized with unique mass signatures using a combinatorial 
synthesis method of the type described previously. In particular embodiments regarding 
diagnostic probes, it may be desirable to generate only a small number of probes, for example 
less than 20. 

The two domain probes are useful for identifying the end sequence within clone inserts. 
As an example, the fixed sequence domain would hybridize to the cloning vector sequence 
immediately adjacent to the insert sequence, The variable sequence is then available to hybridize 
to the cloned insert. Only the probe that is complementary to the cloned insert sequence adjacent 
to the cloning vector sequence will form a perfect hybrid. The remaining two domain probes 
will not. Detection of the mass label signature for the probe that has hybridized using one of the 
methods described will identify the probe sequence and the clone insert sequence. Other 
applications include targeting hypervariable sequence regions or mutation/polymorphism 
analysis at targeted sites. In all cases the fixed sequence of the probe directs the probe to a 
unique region within the target, essentially anchoring where the variable region will probe. 

In order to increase the level of discrimination and extend the read length for the short 
sequence read it is possible to use an enzyme, such as polymerase or ligase, to , add a single 
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nucleotide or oligonucleotide to the end of the variable region of the anchored probe, optionally 
including mass labels on the added nucleotide or oligonucleotide that can identify the sequence 
for these additions. Addition of bases by either enzyme places stricter requirements on the 
variable region being a perfect hybrid to enable enzymatic action. Examples of how these probe 
additions work are shown in FIG. 3B. Note that for polymerase the addition needs to be to the 3' 
end of the probe while ligation can occur at either the 3' end or 5' end. As with the variable 
region within the probe increasing size of the addition wiil necessitate a larger and larger pool to 
represent all possible sequences. Oligonucleotide additions don't necessarily need to be entirely 
variable. There may be cases where the variable region will contain an invariant region. Such 
extensions will increase . the thermodynamic stability of the oligonucleotide addition and allow 
iigatibn to occur at higher temperatures. -It is also possible to envision cases where invariant 
nucleotide sequence would be intermingled with the variable sequences described. 

Combinatorial libraries may also be used to detect short sequences. In cases where the 
full set of probes are used, though, it may not be possible to uniquely identify the sequences of 
the probes if more than one probe is present after hybridization at a significant level. One 
possible approach to limiting the number of probes that hybridize to a particular target is by 
attaching a unique anchoring sequence to the probe set limiting the locations where the probe can 
hybridize. This anchoring is similar to that previously described for analysis of short sequence 
reads. As previously described, it is also possible that extra bases could be added to the end of 
the probe to lengthen the sequence determination and improve discrimination, if necessary. 

A specific example of using the anchored, combinatorial ly synthesized probes is shown 
in FIG. 4B. In the case of screening genomic or cDNA clone inserts the anchored, invariant 
sequence is used to hybridize to the known vector sequence immediately adjacent to the insert or 
in the specific case of a cDNA insert to the poly A/T region of the insert. 

While the preferred embodiment for these assays is to use nonvolatile releasable mass 
labels or involatile releasable mass labels, other types of labels can be used as well, such as 
isotopic mass labels, volatile mass labels (including electrophores), fluorescent labels, and 
chemiluminescent labels. 
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L. Targeted Cleavage Mismatch Detection 

It is of interest to detect the presence of a mutation within a given sequence in cases 
where one. does not have prior knowledge of exactly where the particular mutation might occur. 
Oligonucleotide probes may be used for hybridization to a target DNA containing a single 
mutation within a region of interest, leading to the formation of a mismatch. In one embodiment 
of the invention, enzymatically synthesized mass-labeled probes blocked from double-strand- 
specific enzymatic digestion at the 3' end are used. The 3' ends of the probes can be blocked by 
chemical modification or enzymatically. For example, blocking can be achieved by making the 
3' terminus inaccessible to enzymatic digestion. After hybridization of the probe to the target 
sequence, treatment with a mismatch specific chemical or enzymatic cleaving reagent would 
cleave the hybridized pair at the mismatch site. Representative cleaving -reagents include 
KMn0 4 and T4 endonuclease VII. Subsequent treatment of the cleaved pair with a double- 
strand-specific 3'-5' exonuclease, such as exonuclease III, would lead to digestion of probe from 
the cleavage site to the 5' labeled end, thereby releasing the mass label. This method is 
illustrated in FIG. 5 A and FIG. 5B. As an alternative, the polarity of the system can be reversed 
by placement of the mass label at the 3' end of the probe and by using a double-strand-specific 
5'-3' exonuclease, such as T7 gene 6 exonuclease. 

Another example of mismatch detection involves the amplification of heterozygous target 
DNA using two different mass-labeled probes. The difference can be a single base mutation, for 
example A:T to G:C. Four products are produced by the PCR™ reaction, two fully homogenous 
products representing the original sequences, while the other two products contain a mismatch at 
the mutation site. Treatment with terminal transferase adds long 3' overhangs to all of the 
products. Chemical or enzymatic mismatch specific cleavage is used, affecting only the two 
heterogeneous pairs. Exonuclease III digestion also affects only the cleaved heterogeneous pairs, 
releasing the mass labels without digesting the sequences blocked by the 3' overhangs. This 
method is shown is FIG. 5C and FIG. 5D. These mismatch methods could also be combined 
with other labeling methods such as fluorescent tags or radiolabels. 

While the preferred embodiment for these assays is to use nonvolatile releasable mass 
labels or involatile releasable mass labels, other types of labeis can be used as well, such as 
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isotopic mass labels, volatile mass labels (including electrophores), fluorescent labels, and 
chemiluminescent labels. 



M. Highly multiplexed probe screening assays 

A number of novel applications become possible with multiplexed, mass-labeled probes 
where the preferred mode is to be able to screen a large number of targets simultaneously. 
Multiplexed applications include multiple pathogen diagnostics, multigene genetic 
polymorphism screening, SNP genotyping, clone and gene mapping, and gene expression 
analysis. 

Highly multiplexed analysis by hybridization can be categorized into one of three 
approaches;' (A) hybridization of a library of probes with known sequence against a library of 
targets of unknown sequence, (B) hybridization of a library of probes with unknown sequence 
against a library of targets of known sequence, and (C) hybridization of a library of probes with 
unknown sequence against a library of targets of unknown sequence. 

Approach (A) is beneficial for applications such as diagnostics, genotyping, expression 
analysis and probe mapping where it has been predetermined what sequences are to be screened. 
Many of the methods described above may be used in approach (A). Combinatorially 
synthesized probes can be used with approach (A) where the sequences of the probes (and target 
to which the probe is hybridized) are postdetermined, i.e. probe and then determine the sequence 
of which probe has hybridized. The limits as previously described for combinatorial probes 
apply. Use of repertoire sets of mass labeled probes, as opposed to combinatorial probes, can be 
used in multiplexed mixtures to detect .the presence of short sequences for purposes of 
sequencing by hybridization or producing a probe signature for a particular target sequence. 

Approach (B) provides a path for a number of applications where a library of different 
known DNA sequences, such as oligonucleotides, PCR™ products, RNA transcripts or DNA 
clones, have been arranged and are available for partitioning the unknown probe set. These 
methods often, but not always, include the use of solid phase arrays to physically partition the 
known sequences prior to probing. Applications include competitive hybridization for 
differential expression analysis and fast mapping of genes, subclones or short sequence tags 
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(SSTs) against a master genomic clone library, multiplexed infectious agent detection or any 
other set of samples that need to be probed in a multiplexed fashion. 

Approach (C) is useful in cases where it is not necessary to know sequence but only to 
5 determine trends. As an example, one might want to determine the degree of homology or 
complementarity between two or more species or two or more expressed gene sets. Random or 
semirandom probes against random or semi random target can provide percentage values for 
homology. In these cases probes or targets that exhibit different properties, e.g., fall into the 
nonhomologous category, may be taken on for further analysis to determine their sequences. 
1 0 Such a method could be used for gene discovery. 

A practical example employing these three approaches is in measuring gene expression 
profiles. The most basic way to measure a gene expression profile is statistically, to count the 
number of message RNAs (mRNAs) produced for each particular gene within a particular 

1 5 cellular sample. The more mRNA copies of a particular gene, the higher its level of expression. 
The approach commonly taken is to separate out a representative number of mRNAs through a 
process of copying the mRNA to complementary DNA (cDNA), and then growing up the 
individual clone colonies of each cDNA on culture plates. Typically, cDNAs are cloned by 
insertion into either a plasmid or a phagemid cloning vector, and then transformed into bacteria 

20 or encapsidated into phage respectively. Each clone represents an individual mRNA derived 
from the total population. The set of clones comprises a gene expression library. 

Currently, the common approach used in genomic research to screen the. clones and to 
identify which mRNA/gene correlates to which clone is to sequence the DNA. A portion of each 
25 cDNA clone sequence is read creating an expressed sequence tag (EST) that uniquely identifies 
the message/gene sequence. Identity is made by comparing the EST to genomic data bases 
containing previously identified gene sequences. In several years, all human EST sequences will 
be placed into existing public and private databases. 

30 When screening a particular clone library, possibly a library that includes 10,000 clones, 

any particular EST may appear multiple times. The more times a particular EST appears, the 
higher the expression level for the gene correlating to the EST. The more clones that can be 
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read, the more statistically representative the EST data will be to actual expression. Screening 
larger numbers of clones also makes it more likely that genes expressed at low levels will be 
identified. 

With this in mind, it would be ideal to be able to screen 100,000 or more clones per 
library. However, this level is costly and impractical using existing sequencing technology. 
Typical sequencing screens analyze 500-10,000 samples at a cost of $5,000 to $100,000. New 
DNA sequencing technology will be able to lower this cost somewhat. 

The mass-labeled hybridization probes of the present invention could simplify and lower 
the cost of gene expression analysis. The probe approach - primarily utilizes knowledge of the 
genes to be analyzed. Since the vast majority of gene sequences will be known within a few 
years, it is not necessary to use a de novo technique. It is also possible to detect previously 
unknown genes with these hybridization procedures. Complete identification of new genes may 
require a separate DNA sequencing analysis, subsequent to a hybridization assay, to determine 
the sequence of any of these newly discovered genes. 

As is the case for the sequencing-based approach to gene expression analysis, the 
hybridization approaches of the current invention will usually involve converting the mRNA 
population to cDNA, transforming the cDNA into bacteria and growing bacterial colonies on 
culture plates and screening bacterially derived plasmids. Following the process of approach 
(A), hybridization of a library of known probes against a library of unknown targets (the cDNA 
clones), the clones to be screened can be spotted in a regularly spaced array or grid on a surface 
such as a nylon filter, glass, silicon or gold. The typical process involving bacteria colonies 
involves lysing the bacteria cells on the grid and fixing the DNA to the surface. The grid of 
cDNAs represent the library of tens to hundreds of thousands of expressed messages to be 
probed. 

In conventional methods, a grid can be probed with only one single probe sequence at a 
time, typically being radioactively labeled as shown in FIG. 9A. Following the gridding of the 
unknown cDNAs, the library cDNA array is wetted with a solution containing the labeled nucleic 
acid probe. The grid-probe solution is incubated to allow the probe to hybridize its complement 
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at one or more positions within the grid. Following hybridization, the grid is imaged in order to 
locate the probe-hybridization positions. In order to use multiple probes representing multiple 
genes, the grid needs to be replicated and a different grid is used for each probe. Using 
fluorescent labels, four different chrornophores can be multiplexed within a sample and 
individually detected with the aid of software deconvolution of the fluorescence emission 
spectrum as shown in FIG. 9B. However, the practical upper limit for fluorescence multiplexing 
is likely to be around 10 different labels due to the broad overlapping spectrum produced by 
existing fluorescent chrornophores. 

Use of releasable, nonvolatile mass labels to uniquely label individual probes provides a 
means of using a highly multiplexed set of probes to -simultaneously screen a single grid of 
unknowns. The nucleic acid probes can be synthesized using individual cDNAs with known 
sequence as templates. In all cases the probes may use combinations of mass labels or single 
mass labels. Following synthesis and mass-labeling, the different probes can be combined and 
used to probe a single grid in a multiplex fashion. The probing procedure is identical to that used 
for a single radioactively labeled probe until the imaging step is reached. Instead of using a 
phosphorimager or x-ray film, the grid is scanned within the mass spectrometer after release of 
the labels, pausing briefly at each position to detect the mass label signal that may be present. 

The number of probes used is only limited to the number of probes one is willing to make 
and to the number one is interested in. As an example, one may be interested in a set of 1000 
genes that may play an important role in a particular disease or one may wish to look at 50,000 
different genes. In either case the probes may be individually synthesized or produced in 
combinations in microtiter plates using liquid handling robotics. Likely approaches include the 
performance of T7 RNA polymerase transcriptions of plasmids containing known cDNA inserts 
using mass-labeled nucleoside triphosphates to produce mass-labeled RNA probes, PCR™ 
reactions amplifying known cDNA inserts using either mass-labeled nucleoside triphosphates or 
mass-labeled DNA primers to produce mass-labeled DNA probes, or chemically synthesized 
mass-labeled oligonucleotide probes. Examples of enzymatic probe synthesis are provided in 
FIG. 4C and FIG. 4D. Within each synthesis reaction a different single or unique combination of 
mass-labeled nucleoside triphosphates are added which thereby incorporate a unique mass 
signature within each newly synthesized probe. In . the cases of mass-labeled oligonucleotide 
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probes it is also possible to use chemically synthesized combinatorial probes. Following 
synthesis, the probe set is mixed together to create a master probe mix. A number of master 
probe mixes can be prepared to perform multiplexing if desired, where each cDNA of each 
master probe mix has a unique combination mass label signature. The probe set or sets can then 
be used to probe a large number of different unknown complementary DNA gridded libraries as 
shown in FIG. 9C. Different libraries can be prepared from a variety of samples, for example 
exposed to different stressor conditions and/or different test pharmaceuticals, possibly with time 
as an additional variable. 

An alternative method for gene expression analysis follows the process of approach (B), 
hybridization of a library of unknown probes against a -library of known targets sequences. 
Rather than uniquely labeling known gene probes to hybridize against unknown cDNAs, one can 
label libraries of unknown cDNAs and hybridize against known unlabeled gene probes arrayed 
on a grid. This method has been described for two libraries using fluorescently labeled unknown 
cDNA mixtures (Schena et ai, 1995; incorporated herein by reference) as shown in FIG. 10A. 
In the fluorescent case, first strand cDNA is prepared from two separate cellular samples. 
Synthesis of the first mixture of cDNAs is performed in presence of one particular fluorescent 
nucleotide, and the synthesis of the second mixture in the presence of a different fluorescent 
nucleotide. The mixtures of cDNAs, which reflect the relative abundance of different mRNAs 
from each sample, are then mixed and allowed to competitively hybridize to a gridded array of 
known genes present on a solid phase surface. After the cDNAs have hybridized to the grid, and 
unbound labeled cDNAs are washed away, the relative fluorescence intensity for the two dyes is 
measured at each position in the gridded array. If the fluorescence intensity for each dye is 
equivalent then the corresponding mRNAs from each sample were expressed at a similar level. 
If the fluorescence intensity is stronger for one dye than the other at a particular position/gene in 
the gridded array, then that gene was expressed at a higher level in the sample whose 
fluorescence was stronger. 

By utilizing the mass labeling methods to prepare the cDNAs, rather than fluorescence, it 
is possible to prepare and simultaneously hybridize cDNAs from many different cellular sources 
to the gridded array of known genes. Instead of only two or three cDNA pools being compared 
simultaneously, the use of mass labels makes it possible to compare tens if not hundreds of 
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cDNA pools simultaneously as shown in FIG. 10B. The mass labels can be released by any of 
the appropriate release mechanisms described and the grid can be scanned for the mass label 
signal. The intensity of the mass signals at a given grid position will be proportional to the level 
of mRNA in the original sample that corresponds to the detected cDNA on the grid. The relative 
ratios of the competing mass labels are determined providing information about the differences 
in gene expression between all of the different samples for all of the genes present on the gridded 
array. 



This same multiplexed mass-labeled probe methodology can be used to quickly map 
genes to large genomic libraries. Gridded libraries of PI, PAC/BAC and YAC clones can be 
prepared in the same manner as cDNA filters. Multiple label studies provide a means -for ■quickly 
mapping genes and identifying gene clusters. Probes generated from particular clone inserts or 
gene sequences are used to screen libraries of genomic or cDNA clones. Hybridization events 
indicate an overlap of insert sequence in the genomic case and the presence of a gene in the 
cDNA case. These libraries can also be used for intergenomic probing, e.g., probing a C. elegans 
library with human gene probes,' and visa versa. 

The technology for probing with and detecting mass labels within gridded arrays can also 
be applied to other solid phase systems where DNA probes are utilized, specifically Northern and 
Southern assays. In these two methods the initial phase is to run a polyacrylamide gel and then 
to transfer the DNA to a nylon membrane using a blotting procedure (Sambrook et al 1989). As 
with other procedures described above, mass-labeled nucleic acid probes can be prepared to 
hybridize to the filters. In another embodiment mixtures of single or combinations of mass 
labels can be used in an effort to multiplex the detection. A scan of the filter after hybridization 
and washing within the mass spectrometer provides the means to detect, and where necessary 
quantify, the amount of mass label present in a particular location. 

An additional embodiment of the technology is the use of mass labeled protein probes, in 
the form of antibodies, for hybridization against one and two-dimensional protein gels. One 
skilled in the art can also envision other combinations of mass labeled probe molecules 
hybridized against targets bound to a solid phase matrix. In all cases the mass label is released 
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and either the solid phase surface analyzed using a scanning mass spectrometer, or a transfer to 
another surface takes place before mass analysis. 

Attachment of the genetic target or other target to a filter or other form of grid is not 
necessary as part of the broadest embodiments of the invention. For example, a mass-labeled 
probe set may be directly hybridized to DNA or RNA targets in solution. In order to 
discriminate between the probes that hybridize and the probes that do not, one of two possible 
events needs to occur. Either the mass labels on hybridized probes need to be enzymatically 
released using a double-strand-specific nuclease, such as exonuclease III, lambda exonuclease, 
T7 gene 6 exonuclease or a restriction endonuclease, or some partitioning event needs to occur 
wherein unhybridized probes are separated from hybridized probes. One of skill in the art can 
envision several means for partitioning other than pre-binding of the target to a solid phase array 
as described in the methods above, such as hybridized probe extension by a polymerase using 
biotinylated nucleotides, or coupling the mass labeled probe to a biotinylated probe as part of an 
amplification event, such as PCR™ or LCR. 

For both the nuclease case and the partitioning case, an amplification event can be used to 
produce a significant amount of mass label. Mass labels attached to a probe hybridizing 
downstream from one of the PCR™ primers can be released during PCR™ amplification using 
the nick translation 5'-3 T exonuclease activity of the thermostable polymerase. Mass labels 
within primers can be released using a 5'-3' exonuclease such as T7 gene 6 exonuclease after 
amplification. In embodiments where a mass labeled primer is coupled to a biotinylated primer 
during amplification, or biotin is incorporated through the use of biotinylated nucleotides, and 
the product is partitioned away from the unincorporated primers, it is possible to use nonspecific 
cleavage, such as chemical cleavage methods, to the release of the mass label. 

In another embodiment, hybridization-specific nuclease digestion can also be used to 
cleave a probe containing both biotin and mass label, in an assay where solid-phase-bound 
steptavidin is used to remove uncleaved mass labels. Examples of such cleavage involve the use 
of a double-strand-specific nuclease such as those described above. Restriction endonucleases 
may be used to cleave a probe that contains a restriction site in the center and a mass label and 
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biotin at opposing ends of the probe. Another example, where RNA is used as a probe, involves 
double-strand-specific cleavage using RNase H. 



In another examplary method for the detection of an amplified single-stranded target such 
as that produced by T7 RNA polymerase transcription, a double-stranded probe is prepared with 
the mass label being attached to the strand that is homologous in sequence to the target strand. 
The mass-labeled strand is then displaced by a competitive hybridization with target and the 
mass label is released by a single-strand specific exonuclease such as exonuclease VII, Mung 
Bean nuclease or nuclease SI. An alternate method would employ the use of single-strand 
specific chemical cleavage reagent to release the mass label from a chemically modified probe. 
Examples of chemical modifications that would provide single-strand specific release of mass 
label include cleavage of a ribonucleotide base by transesteriflcation, a phosphoramidate 
cleavable by acid, and a 5' -P-S phosphorothioate cleavable by silver nitrate as described in 
Example 9. 

PCR™ can also be combined with the use of a mass labeled primer and a restriction 
enzyme to enable release of a mass label only if amplification occurs. In this embodiment the 
mass labeled PCR™ primer contains the sequence for a restriction site that becomes double- 
stranded only as part of the amplification process. Once the site is double stranded, it is 
recognized by the restriction enzyme and cleaved. The cleavage event releases the mass label 
from bulk of the primer and PCR™ product allowing it to be uniquely detected. 

An embodiment of the invention where mass-labeled probes can be used to measure 
mRNA levels in solution is shown schematically in FIG. 14. A series of gene-specific, mass- 
labeled probes (1-100 per study) are added to the mRNA pool (or more likely, first-strand 
cDNAs derived from the mRNA pool) and allowed to hybridize. Each gene-specific probe 
carries a unique mass label, and possibly multiple copies of that label to increase sensitivity. The 
hybridized mixture is treated with a double-strand-specific exonuclease that releases the mass 
labels for the portion of the probe population that was hybridized to target genes. Only if the 
mRNA from a gene of interest is present will the corresponding mass label be released and 
detected. In addition, the signal intensity for the particular mass label will be proportional to the 
relative abundance of the particular mRNA within the pool. Comparisons of the relative 
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intensities for the different mass labels reflect the relative mRNA expression levels. The relative 
gene expression pattern for as many as 38,400 genes could be probed for in a single 384 
microtiter plate if 100 different probes per well are used. Conversely, a set of 100 genes could be 
examined for 384 different samples in a single microtiter plate experiment. 

There are examples where the mass spectrometric sensitivity levels may be found to be 
insufficient to directly monitor the mRNA levels, e.g., due to small numbers of cells as a result 
of poor cell growth, or in animal model samples derived from very small tissue biopsies. For 
such samples, it may be necessary to incorporate message amplification schemes into the 
methodology. 

As described earlier, the use of nucleases that digest mass-labeled nucleic acid probes 
when they are hybridized to a target nucleic acid affords the possibility for linear amplification of 
signal. In cases where the target DNA is single stranded and significantly longer than the probe 
being used, it is possible to selectively digest only the probe. Digestion of the oligonucleotide 
probe makes the target strand repeatedly available for multiple rounds of hybridization and 
digestion. This type of amplification can readily achieve 2 to 3 orders magnitude of 
amplification. 

Because any given study may only monitor a relatively small number of genes, e.g., 20 to 
100, it may be possible to use one or a few multiplexed PCR™ reactions to amplify only the 
targets associated with the probe set. The use of PCR™ or other amplification methods may 
require the development of additional controls so as to reduce the influence of amplification 
artifacts. The multiplexing ability of mass-labeled probes makes it easy to include one or more 
controls. The use of redundant or semi-redundant primers, such as those used in differential 
display techniques, may also provide an effective amplification route. In all cases where a 
polymerase is used for amplification, such as Taq DNA polymerase, the 5* to 3' exonuclease 
activity can be used to digest the probe while amplification continues (Holland et al y 1991). 



All of the solution phase methods, including methods that utilize partitioning, described 
above may be utilized as a means for coupling the release of a mass label to the presence of a 
particular mRNA sequence. Other methods that may be used in amplification of the message 
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population include ligase chain reaction, in vitro transcription of the cDNA population, and 
variants of methods for producing cDNA libraries, such as single-well polyclonal cDNA plasmid 
growth. 



As the full gene set of an organism becomes available, it is conceivable to prepare 
beforehand the complete set of mass-labeled probes for gene expression analysis. With probes 
being enzymatically synthesized, a large stock of these probes can be made at a relatively 
inexpensive cost in less than a week of effort. It is also possible to quickly make a repertoire of 
mass-labeled probes through chemical means. 

While the preferred embodiment for the assays described herein is to use nonvolatile 
releasable mass labels or involatile releasable mass labels, other types of labels can be used as 
well, such as isotopic mass labels, volatile mass labels (including electrophores), fluorescent 
labels, and chemiluminescent labels. 

N. Multiplexed mass label substrates in affinity assays 

The methods disclosed herein may also be employed in indirect schemes for identifying 
the presence of one or more target biomolecules. Indirect schemes, such as enzyme-linked 
immunosorbent assays (ELISAs), provide a method for utilizing substrate conversion to a 
product molecule via enzymatic turnover of the substrate. Enzymatic catalysis of a substrate 
leads to the linear amplification of the product's signal. 

In an ELISA the target molecules, generally bound to the solid phase, are recognized by 
an antibody which noncovalently binds to the target. The recognition antibody is conjugated to 
an enzyme used to catalyze substrate conversion to product. Traditional ELISA techniques 
utilize small organic molecule substrates that when converted to product by an enzyme, such as 
alkaline phosphatase, horse-radish peroxidase, or urease, yield a molecule with changed optical 
qualities, e.g. the solution becomes colored or the product possesses strong fluorescence. In 
addition, the conversion of substrate to. product often produces a change in mass, thus the product 
may act as a mass label that may be detected by mass spectrometry. The amount of product may 
be quantified either absolutely or relative to the substrate used, with knowledge of enzyme 
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turnover rates and reaction conditions, and used to calculate the amount of a target molecule 
present in the assay. 

Methods for traditional ELISA assays are well established (see Current Protocols in 
Molecular Biology Vol. 2, Chapter 11, incorporated as a reference herein). Multiple protocols 
exist, which include indirect, direct competitive, antibody -sandwich, double antibody -sandwich, 
direct cellular, and indirect cellular assays. The mass label modification envisioned in this 
application would be designed to measure unknown quanties of target biomolecules by 
adaptation of the traditional ELISA methods. In this modification, target biomolecules are 
covalently or noncovalently bound to a surface, such as on a bead or a plastic dish, either directly 
or through a small "capturing" molecule (ligand) or a protein (such as an antibody). The target 
biomolecule could also be a component of a cell that could be bound to the surface of the vessel. 
The solid-phase target biomolecules are incubated with a target recognition molecule (antibody, 
ligand, oligonucleotide, etc.). that has a specific affinity for the target biomolecule. This target 
recognition molecule is conjugated to an enzyme. For multiplexed assays each target 
recognition molecule must be covalently linked to an enzyme with a unique catalytic activity for 
differentiation of the different targets (typical of the "direct" assay protocols). These conjugated 
target recognition molecules are allowed to bind to the substrate; unbound molecules are 
removed by washing, then the enzyme substrates are added under conditions in which bound 
enzyme reacts with its substrate to release a product with a unique mass that is detectable using 
mass spectrometry. 

"Capture antibodies" with high specific binding affinity for the antigens may be needed 
for soluble antigens. Methods for preparation of specific antibodies for either capture or 
quantitation of antigens are well established in the literature. Methods for conjugating enzymes 
to antibodies are also well established and may include crosslinking agents such as 
glutaraldehyde or conjugation via perioxidate oxidation. Purified DNA restriction enzymes are 
commercially available. New enzymes with unique catalytic activity may also be engineered 
using established molecular procedures. 

The ease of detection of a multiplex of mass labels offers the opportunity for the 
performance of a multiplex of immuno assays simultaneously within a single solution. Different 
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enzymes, conjugated to antibodies or other target recognition molecules, used in combination 
with a set of enzyme-specific substrates may be used to yield enzymatic products that are unique 
in mass and therefore uniquely detectable and quantitatable by mass spectrometry. 

In addition to multiplexing an unrelated set of enzymes and substrates, classes of 
enzymes that modify a class of substrates may also be multiplexed. For example, classes of 
enzymes all recognizing the same substrate but modifying it in different ways may be employed 
as may enzymes which recognize and modify particular chemically-related substrates, where the 
variations in structure alter the specificity of particular enzymes for the particular substrate. 

A class .of .enzymes al! -recognizing the same or a few substrates is proteases. Proteases 
recognize different amino acids or amino acid sequence motifs and cleave the amide linkage 
yielding two or more fragments. Examples of proteases and their specificities include: trypsin, 
which cleaves at the C-terminaJ side of both arginine and lysine residues; thrombin, which 
cleaves at arginine; Glu-C, which cleaves at the C-terminal side of glutamic acid residues; Lys- 
C, which cleaves at the C-terminal side of lysines; and Asp-N, which cleaves at the N-terminal 
side of aspartic acid residues. Small polypeptides containing specific amino acids and/or amino 
acid sequence motifs may be used as substrates for proteolytic digestion. The use of one or a few 
polypeptides that are recognized and cleaved differently by different proteases sets up a situation 
where there is a competition for substrate. The use of competitive substrates, and measurements 
of the relative ratios of different products derived from the same substrate, may provide a more 
accurate measure of the relative quantities of different target biomolecules. 

One potential problem with the use of proteases is their possible digestion of antibodies 
and other proteins required for the bioassay. This problem may be overcome through a variety of 
means including, careful selection of proteases, selective chemical modification to block 
proteolysis, and use of protease inhibitors including those that can be competitively displaced by 
the reaction substrates. Alternatively, proteases may be used on other nonprotein-based assays 
such as probing for nucleic acid using oligonucleotide probes conjugated to the proteases. Other 
classes of enzymes that may be used instead of proteases include kinases which phophorylate 
their substrates and nucleases. 



WO 98/26095 ^ PCT/US97/22639 

Ribonucleases and deoxyribonucl eases have varying specificity. Endonucleases such as 
RNase Tl, Rnase U2, and Rnase CL3 , target G, A, and C nucleotides, respectively. In a similar 
manner to the use of small polypeptides as substrate for proteases, small oligonucleotides may be 
used with nucleases. Nuclease resistant nucleotides, such as phosphorothioates, 
methylphosphonates, boranophosphates, and peptide nucleic acids can be incorporated into the 
substrates to direct the specificity of the different nucleases toward yielding unique products. 
Unlike peptides which can be simply and easily detected by mass spectrometry it may be 
prefered to modify the oligonucleotides with the addition of polypeptides or other molecules to 
improve and ease analysis in the mass spectrometer. 

Another class of enzymes is restriction endonucleases. Use of restriction enzymes falls 
under the second case described above, where substrates may be chemically related but 
variations in structure alter their specificity as far as to which enzyme in the class will recognize 
and modify it. In this case the structural alterations are changes in the sequence of the substrates. 
The substrates themselves are small double-stranded oligonucleotides which contain one or more 
restriction endonuclease recognition and cleavage sites. Similar to the use of nucleases described 
above, and as is described in other sections of this invention, it is prefered to modify the 
oligonucleotides with the addition of polypeptides or other molecules to improve and ease 
analysis and selectivity in the mass spectrometer. Because many restriction endonucleases 
recognize palindromic sequences it is also possible to increase the level of signal two-fold by the 
use of palindromic oligonucleotide substrates which form dimers. Each cleavage event forms 
two identical products. Longer concatamers may also be produced creating larger, multi-mass- 
labeled substrate. 

Antibodies are not the only possible target-recognition molecule that may be used in 
these assays. Polypeptides derived from methods such as phage display with target binding 
properties, as well as a variety of native proteins that demonstrate some binding activity of 
interest, may be used instead. Targets may also be something other than proteins and can include 
a variety of biologically relevant small molecules, including enzyme cofactors, hormones, 
neurotransmitters, and other biopolymers including polysaccharides and most importantly 
nucleic acids. Nucleic acid hybridization interactions may be used where both the target and the 
recognition molecule are comprised of nucleic acids. Nucleic acids and other nonpeptide 
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recognition molecules may. be bound to the enzyme involved in substrate conversion covalently 
via a variety of linkage chemistries, some of which have been described here in the XXX section, 
or noncovalently through a biotin/avidin linkage where the avidin is conjugated to the substrate 
conversion enzyme. One skilled in the art can identify other linking methods. 

The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill in the art that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventors to function well in the 
practice of the invention, and thus can be considered to constitute preferred modes for its 
practice. However, those of skill in the art should, in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and -still obtain 
a like or similar result without departing from the spirit and scope of the invention. 

EXAMPLE 1 
Synthesis of Peptide-Labeled Oligonucleotides 

A. Preparation of Peptide-Linked Nucleoside 5' Triphosphates 

Preparation of pepti de-linked nucleoside 5' -triphosphates involves synthesis and coupling 
of allylamino-substituted dNTPs. An example is shown in FIG. 6A. 5-(3-aminoallyl)-2'- 
deoxyuridine 5 '-triphosphate (c) was prepared according to the procedure of Langer et al (1981). 
Treatment of dUTP (a) with mercuric acetate at pH 5-7 provides the 5-mercurated derivative (b), 
Allylation in the presence of a palladium catalyst then provided c, which was coupled to the 
NHS-ester (d) of a suitably protected peptide (lysine and N-terminal amines blocked with FMOC 
groups). Base deprotection of the peptide resulted in formation of the desired product (e). 
Alternatively, the allylamino-nucleotide (c) was treated sequentially with the hetero-bifunctional 
crossiinking reagent mal-sac-HNSA (Bachem Bioscience Inc., King of Prussia, PA) and an N- 
terminal cysteine peptide to give the conjugate (f). 

B. Preparation of Peptide-Labeled Phosphoramidites 

Peptide nucleoside phosphoramidite conjugates were prepared from 5'-protected 
allylaminonucleosides as shown in FIG. 6B. Selective dimethoxytritylation of uridine (h) 
provided the 5'-DMT ether (i), that was allylated via the mercurinucleoside with palladium 
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catalyst (Dale etai, 1973; Langer et al y 1981). Treatment of the NHS-ester of a suitably 
protected peptide and conversion of the conjugate to the phosphoramidite (Sproat etaL, 1987) 
provided the desired compound (k). 

C. Synthesis of a 5' Labeled Oligonucleotide-Peptide Conjugate 

Oligonucleotide g (FIG. 6C) (SEQ ID NO: 10) was prepared using standard solid-phase 
phosphoramidite chemistry. The S'-amino-modification through a disulfide linkage was 
achieved by sequential addition of Thio-Modifier C6 S-S and Ammo-Modifier C6 df (Glen 
Research Inc., Sterling, VA) to the 5'-end. The oligonucleotide was coupled to the 
heterobifunctonal reagent mai-sac-HNSA (Bach em California Inc., Torrance, CA*) thrGiish the 
terminal primary amino group, purified by exclusion chromatography, and covalently coupled to 
a peptide with the sequence CGR GSG K through the N-terminal cysteine thiol. The conjugate 
was purified by ion-exchange chromatography, and analyzed by MALDI-TOF mass 
spectrometry (FIG 7X). The peak at m/z 8401 in FIG 7X corresponds to the desired conjugate. 

D. Synthesis of a 3' Labeled Oligonucleotide 

A 3' phosphorylated oligonucleotide with the sequence 
5'-TGAGGTGCGTGTTTGTGCCTGTp-3' (SEQ ID NO: 1) was synthesized by standard 
phosphoramidite chemistry. A MALDI mass spectrum of the unconjugated oligonucleotide is 
shown in FIG. 7A. The 3'-terminal T residue of the oligonucleotide was modified with a primary 
amino-group that was incorporated during the synthesis as the modified phosphoramidite (C6- 
amirio modifier, Glen Research Inc., Sterling, VA). The oligonucleotide was coupled through 
the active amino group to a peptide using the hetero-bifunctional coupling reagent mal-sac- 
HNSA (Bachem Inc., Torrance, CA). The sequence of the peptide used for coupling to the 
oligonucleotide was CGYGPKKKRKVGG (SEQ ID NO: 2) (Sigma Chemical Co., St. Louis, 
MO). The reaction to couple the peptide to the oligonucleotide occurs at the reactive thiol group 
on the N-terminal cysteine residue. After the coupling reaction, which is carried out according to 
standard procedure, the crude coupled product is purified by reversed phase HPLC. Fractions 
containing the desired coupled product were identified by MALDI-MS, and were combined and 
evaporated to dryness. The dried material was dissolved in a small amount of water and the 
concentration determined by UV absorbance at 260 nm. A MALDI mass spectrum of the 
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oligonucleotide-peptide conjugate is shown in FIG. 7B. The major peak at m/z 8622.8 agrees 
well with desired product, while the peak at 7051.7 is due to a residual amount of unreacted 
oligonucleotide (ca. 20%). 



E. Synthesis of an Internally-Labeled Oligonucleotide-Peptide Conjugate 

An oligonucleotide of the sequence 5'-GGT TTA CAT GTT CCA A(aminoT)A TGA 
T-3' (SEQ ID NO: 11) was prepared by standard phosphoramidite chemistry using Amino- 
Modifier C6 dT (Glen Research Inc., Sterling, VA) to incorporate the internal amino- 
modification. The oligonucleotide was coupled to the hetrobifunctionai reagent mal-sac-HNSA 
(Bachem Califomai Inc., Torrance, CA) through the internal primary amino group, purified by 
exclusion chromatography, and covalently coupled to a peptide with the sequence CGT RGS 
GKG TG through the N-terminal cysteine thiol. The conjugate was purified by ion-exchange 
chromatography, and analyzed by MALDI-TOF mass spectrometry (FIG 7X). The peak at m/z 
8075 in FIG 7X corresponds to the desired conjugate. 



EXAMPLE 2 
Detection of a Specific Target Sequence 

As an example of the utility of the oligonucleotide-peptide conjugate as a probe in a 
hybridization study, a model system was designed using a synthetic complementary strand as 
target DNA. A 42-mer was synthesized as a model target, with the sequence 
5'-CTCCCAGGACAGGCACAAACACGCACCTCAAAGCTGTTCCGT-3' (SEQ ID NO:3). 
Detection of the target was based on release of the peptide mass label (SEQ ID NO: 2) from the 
probe by a digestion with the 3'-5' double-strand-specific exonuclease III with analysis by 
MALDI-MS. 

A mixture of 1 pmol of probe and 1 pmol of target in a 9 jaL volume of IX Exonuclease 
III buffer (66mM Tris-HCl, pH 8.0; 5mM DTT; 6.6mM MgCh; 50. jag/mL BSA) was allowed to 
anneal by heating the solution for 2 minutes in a boiling water bath and then slowly cooling it to 
room temperature over the course of about 20 minutes. Exonuclease III (USB, Cleveland, OH) 
was diluted from its stock concentration of 17.5 U/^iL to 0.35 U/jiL in IX buffer, and a 1 
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aliquot was added to the annealed target-probe solution. Four controls were included and run 
simultaneously with the test solution. Control sample A contained both target and probe but no 
exonuclease III, control sample B contained probe and Exonuclease III but no target, control 
sample C contained probe and Exonuclease III together with a "random non-complementary 36- 
mer, and control sample D contained only Exonuclease III. The mixtures were allowed to 
incubate for 30 minutes at room temperature. A 1 jj.L aliquot of the solution was removed and 
added on top of a polycrystalline spot of 2,5-dihydroxybenzoic acid on a MALDI-MS sample 
plate. The resulting positive-ion mass spectra of the test and control samples A, B and C are 
shown in FIG. 8A, FIG. 8B, FIG. 8C, and FIG 8D. Only the test sample in FIG. 8A showed a 
peak at 2045.3, the mass expected for the released peptide-nucleotide conjugate, demonstrating 
that in this model system the inventors were able to specifically detect the presence of the target 
sequence by a sensitive and rapid method. 

Selective Enzymatic Cleavage of a Peptide. Oxidized bovine insulin chain B (Sigma 
Chemical Company, St. Louis, MO) in Tris»HCI (pH=7.8) was treated with Endoproteinase Glu- 
C (w/w ratio 20:1, Sigma Chemical Company, St. Louis, MO) at 37 °C for 2 hours, and 
examined by MALDI-TOF mass spectrometry. The analysis (FIG XX) indicated that the insulin 
(SEQ ID NO: 12) was efficiently cleaved at the carboxyl side of glutamyl residues into three 
fragments, m/z 1533 (FVNQHLC[S0 3 H]GSHLVE) (SEQ ID NO: 13), m/z 1089 
(RGFFYTPKA) (SEQ ID NO: 14), and m/z 919 (ALYLVC[S0 3 H]GE) (SEQ ID NO: 15). The 
relative intensities of the three peaks in the mass spectrum reflect the number of basic (ionizable) 
functionalities in the three fragments. The largest molecular weight fragment contains two 
moderately basic histidine residues and is therefore only modestly visible in the spectrum. The 
middle fragment contains strongly basic lysine and arginine residues and therefore displays an 
intense peak. The smallest fragment has only the terminal amino-group available for 
protonation, and is therefore barely detectable in the spectrum. 

EXAMPLE 3 

Detection of mRNA using Mass-Labeled Primers and rtPCR™ 

A pair of PCR™ primers for the ribosomal protein L7 gene was synthesized by standard 
phosphoramidite chemistry with a modified amino -thymidine (Glen Research, Sterlin,VA) 
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incorporated near the 3'-end of each. The sequence of the forward primer was 5'- 
ATCTGAAGTCAGTAAAT*GAAC-3' (SEQ ID NO:4) and the sequence for the reverse primer 
was 5'- ATTT ACC AG AG AT* CG AG-3 ' (SEQ ID. NO:5), where T* represents the amino- 
modified thymidine. Each primer was mass-labeled with a unique peptide by a standard 
coupling reaction between the amino group of the amino-modified thymidine and a sulfhydryl 
group on the peptide through the heterobifunctional linker mal-SAC-HNSA (Bachem Corp., 
Torrance CA), and purified by ion-exchange HPLC. The peptide mass label used for the forward 
primer had the sequence CGYGPKKKRKVGG (SEQ ID NO:2), and for the reverse primer the 
peptide was CKNLNKDKQVYRATHR (SEQ ID NO:6). 

A reverse transcription reaction was performed on 10 pg of total RNA isolated. from a 
stable cancer cell line to generate first strand cDNA. The reaction was performed in a total 
volume of 20 and contained 0.5 mg of oligo dT, 5 primer (SEQ ID NO:9) and 25 units of 
AMV reverse transcriptase. A PCR™ reaction was performed on 1 jil of the first strand cDNA 
using 10 pmol each of the forward and reverse mass-labeled primers and 0.25 units of Taq DNA 
polymerase in a 10 jal reaction. The rtPCR™ product was purified through a Microcon-30 
ultrafiltration unit (Amicon, Inc., Beverly, MA) according to the manufacturer's directions. After 
collecting the DNA from the filter unit, it was evaporated to dryness in a vacuum centrifuge and 
resuspended in 3.5 jal H 2 0. 

A digestion reaction using the double-strand specific 5'-3* exonuclease of T7 gene 6 was 
then performed. To the' 3.5 ^1 of purified PCR™ product was added 0:5 jil of 10X buffer (660 
mM Tris, pH 8, 6.6mM MgCl 2 ) followed by 1 nl (5 units) of T7 gene 6 exonuclease (Amersham * 
Inc.). A control digestion was performed at the same time and contained 5 units of enzyme, 5 
pmol of free forward primer in an identical buffer. The digestion reactions were allowed to 
incubate at 37°C for 60 minutes follwed by a heat inactivation of the enzyme (85°C for 15 
minutes). A small portion of anion exchange resin (DEAE Sephadex A-25, Aldrich Chemical 
Co., Milwaukee, WI) was added to each digestion and a 1 fil portion of the supernatant was 
removed and analysed by MALDI-TOF mass spectrometry (positive ions, 2,5-dihydroxy benzoic 
acid matrix). The resulting mass spectra of the digested PCR™ product and control are shown in 
FIG. 15A and FIG. 15B respectively. 
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EXAMPLE 4 
Detection of a Mixture of cDNA Plasmids 
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A mixture of 100 ng each of six and 50 ng of a seventh single-strand Ml 3 plasmid 
clones, each containing unique inserts, was desalted and concentrated in a Microcon-30 
ultrafiltration unit according to the manufacturer's directions. The DNA, after collection, was 
evaporated to dryness and resuspended in 1 ul of H 2 G. A mixture of seven mass-labeled probes 
containing 2.5 pmol each was added. Each probe was complementary to a portion of the insert 
for each clone in the mixture and was coupled to a unique peptide mass label. The probes were 
allowed to hybridize by heating the mixture to 95°C for 30 seconds followed by a 1 minute 
incubation at 45°C. After cooling the mixture to 37°C ; 0,3 5 units of Exonuclease III -was added 
and the digestion was allowed to proceed for 60 minutes. The reaction was allowed to cool to 
room temperature and then a small portion of DEAE Sephadex A25 anion exchange resin 
(Aldrich Chemical Co., Milwaukee, WI) was added. A 1 |il portion of the supernatant was then 
removed and analysed by MALDI-TOF mass spectrometry (positive ions, 2,5-dihydroxy benzoic 
acid matrix). The resulting mass spectrum of the mixture of released mass labels is shown in 
FIG. 16. 

EXAMPLE 5 

SNP Analysis with Mass-labeled Primers and Biotinvlated Dideoxvucleoside Triphosphates 

A primer ("Primer A") containing a chemically-releasable mass label is synthesized and 
purified according to the method described in Example 1C. Two synthetic template strands are 
also synthesized by standard solid phase synthesis techniques. The sequence of Primer A is 5'- 
LTSS- GTGCTCAAGAACTACATGG -3' (SEQ ID NO: 16) and the sequences for the template 
strands are 5 '-TACTCCAGTTCCATGTAGTTCTTGAGC AC-3 ' (Template IT) (SEQ ID NO: 

17) and 5 '-TACTCCAGTACC ATGTAGTTCTTGAGC AC-3 ' (Template 1A) (SEQ ID NO: 

18) , where LT indicates the mass label attached to an amino-modified thymidine, SS represents 
the chemically cleavable disulfide-containing group, and the boldface base designations in the 
template strands indicate the polymorphic sites adjacent to the 3'-end of the primer. The primer 
is mass-labeled with a synthetic peptide possessing the sequence CGRGSGK (SEQ ID NO: 19). 
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Two cycle-sequencing reactions are performed. Each reaction contains 2 pmol of mass-labeled 
Primer A, 100 fmol of either Template IT or Template 1A, 200 pmol of Biotin-ddUTP 
(Boehringer-Mannheim, Inc.) and 2.4 units of the thermostable DNA polymerase AmpliTaq-FS 
(Perkin-Elmer Inc.) in a total volume of 20 uL. Both reactions are begun using typical hot-start 
conditions. The reactions are performed according to the following thermal cycling program: 
denaturing at 90 °C for 30 s, annealing at 50 °C for 10 s, extension at 65 °C for 10 s, for a total of 
35 cycles. Upon completion, the sequencing reacions are purified by capturing the extended 
biotinylated products on streptavidin-coated magnetic beads. The beads are washed to remove 
unextended primer and then the mass label released by treatment of the bead-bound product with 
a mild reducing agent to cleave the disulfide bond and release the mass label into solution. A 1 
uL portion of the supernatant is removed and analysed by MALDI-TOF mass spectrometry 
(positive ions, 2,5-dihydroxy benzoic acid matrix). The resulting mass spectra of the reaction 
containing the correct template to extend with biotin-ddUTP and of the reaction containing the 
incorrect template are shown in FIG. 17A and FIG.17B, respectively. Since signal can only be 
seen in the spectrum in FIG. 17A as expected for the proper nucleotide incorporation, these 
results demonstrate the possibility of performing an SNP analysis using a mass-labeled primer 
together with biotinylated dideoxynucleoside triphosphates. 

EXAMPLE 6 

Multiplexed SNP Analysis with Mass-labeled Primers and Biotinylated Dideoxynucleoside 

Triphosphates 

Two primers ("Primer B" and "Primer C") each containing a unique chemically- 
releasable mass label are synthesized and purified according to the method described in Example 
1C. A synthetic template strand for each is also synthesized by standard solid phase synthesis 
techniques. The sequence of the Primer B is 5 '-LTSS-TCGGAGTC AACGG ATTTG -3' (SEQ 
ID NO: 20) and the sequence for the corresponding template strand is 5'- 
TCCAGTTCTCAAATCCGTTGACTCCGA -3» ("Template 2T") (SEQ ID NO: 21). Primer C 
and its template strand ("Template 3T") have the sequences 5'-LTSS- 
GATGTCTGTATATGTTGCACTG -3' (SEQ ID NO: 22) and 5*- 
AAGTTGACTCTCAGTGCAACATATACAGACATC-3' (SEQ ID NO: 23), respectively, 
where LT, SS, and boldface have the same meanings as described in Example 5. Primer B is 
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mass-labeled with the synthetic peptide CAGGRGGGKGGA (SEQ ID NO: 24) and Primer C 
with the synthetic peptide CASGRGSGKGSA (SEQ ID NO: 25). 

A multiplexed cycle-sequencing reaction is performed with Primer A, Primer B, Primer C 
and each of the corresponding templates. The reaction contains 2 pmol of each mass-labeled 
primer, 100 fmol each of Template IT, Template 2T and Template 3T, 200 pmol of Biotin- 
ddATP (Clonetech, Inc.) and 2.4 units of the thermostable DNA polymerase AmpliTaq-FS 
(Perkin-Elmer Inc.) in a total volume of 20 jiL. The reaction is begun using typical hot-start 
conditions and is performed according to the following thermal cycling program: denaturing at 
90 °C for 30 s, annealing at 50 °C for 10 s, extension at 65 °C for 10 s, for a total of 35 cycles. 
Upon completion, the sequencing reaction is purified by capturing the extended biotinylated 
products on streptavidin-coated magnetic beads. The beads are washed to remove unextended 
primer and then the mass labels released by treatment of the bead-bound products with a mild 
reducing agent to cleave the disulfide bonds and release the mass labels into solution. A 1 u,L 
portion of the supernatant is removed and analysed by MALDI-TOF mass spectrometry (positive 
ions, 2,5-dihydroxy benzoic acid matrix). The resulting mass spectrum showing signals for each 
of the expected mass-labels with peaks labeled as A, B and C referring to primers A, B, and C 
respectively is shown in FIG. 18. This demonstrates the potential for performing multiplex SNP 
analyses utilizing mass-labeled primers. 

EXAMPLE 7 

SNP Analysis with Mass-labeled Primers and Biotinylated Nucleoside 
Triphosphates plus Normal Dideoxvnucleoside Triphosphates. 

. Two cycle-sequencing reactions are performed with primer A and one of either template 
IT (SEQ ID NO: 17) or template 1 A (SEQ ID NO: 18). Each reaction contains 2 pmol of mass- 
labeled primer and 100 fmol of template. The triphosphates in each reaction consist of 200 pmol 
each of Biotin-dCTP (Clonetech, Inc.), dATP and ddTTP. The reactions are performed with 2.4 
units of the thermostable DNA polymerase AmpliTaq-FS (Perkin-Elmer Inc.) in a total volume 
of 20 mL. - The reactions are begun using typical hot-start conditions and are performed 
according to the following thermal cycling program: denaturing at 90 °C for 30 s, annealing at 
50 °C for 10 s, extension at 65 °C for 10 s, for a total of 35 cycles. Upon completion, the 
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sequencing reactions are purified by capturing the extended biotinylated products on 
streptavidin-coated magnetic beads. The beads are washed to remove unextended primer and 
then the mass labels released by treatment of the bead-bound products with a mild reducing agent 
to cleave the disulfide bonds and release the mass labels into solution. A 1 mL portion of each 
supernatant is removed and analysed by MALDI-TOF mass spectrometry (positive ions, 2,5- 
dihydroxy benzoic acid matrix). The resulting mass spectra for the reaction containing template 
IT and the reaction containing template 1A are shown in FIG. 19A [XX3a] and FIG. 19B 
[XX3bJ respectively. 



EXAMPLE 8 

Mass label Tagging of Degenerate Base.Rrimers and the Identification of Sequence 
Variants by Extension with Biotinylated Dideoxvnucleoside Triphosphates 

Two primers related to Primer A and differing only in the identity of the 3'-terminal base 
are synthesized and mass-labeled according to the method described in Example 1C. The 
sequence of Primer D is 5'-LTSS- GTGCTCAAGAACTACATGA -3' (SEQ ID NO: 26) and the 
sequence of Primer E is 5*-LTSS- GTGCTCAAGAACTACATGT -3' (SEQ ID NO: 27), where 
LT and SS have the meanings described in Example 5. A synthetic template strand ("Template 
4A") is also synthesized using standard solid phase synthesis techniques. The sequence of the 
template strand is 5 '-TACTCC AGTTAC ATGTAGTTCTTGAGC AC-3 * (SEQ ID NO: 28), 
where the boldface indicates the base that varies from Template IT. Primers D and E are mass- 
labeled with two unique synthetic peptide that differ from the peptide attached to Primer A. The 
peptide attached to Primer D is CAGGRGGGKGGA (SEQ ID NO: 29), while the peptide 
attached to primerE is CASGRGSGKGSA (SEQ ID NO: 30). 

Two cycle-sequencing reactions are performed. Each reaction contains 2 pmol each of 
mass-labeled Primer A, Primer D, and Primer E, 100 fmol of either Template IT or Template 
4A, 200 pmol of Biotin-ddATP (Clonetech, Inc.) and 2.4 units of the thermostable DNA 
polymerase ArnpliTaq-FS (Perkin-Elmer Inc.) in a total volume of 20 jiL. Both reactions are 
begun using- typical hot-start conditions. The reactions are performed according to the following 
thermal cycling program: denaturing at 90 °C for 30 s, annealing at 60 °C for 10 s, extension at 
65 °C for 10 s, for a total of 35 cycles. Upon completion, the sequencing reacions are purified by 
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capturing the extended biotinylated products on streptavidin-coated magnetic beads. The beads 
are washed to remove unextended primer and then the mass label released by treatment of the 
bead-bound product with a mild reducing agent to cleave the disulfide bond and release the mass 
label into solution. A 1 uL portion of the supernatant is removed and analysed by MALDI-TOF 
mass spectrometry (positive. ions, 2,5-dihydroxy benzoic acid matrix). The resulting mass spectra 
for the Primer E matched template and for the Primer A matched template are shown in FIG. 
20A and FIG. 20B, respectively. When primer E is perfectly matched to the template, the 
predominant mass label signal seen in the mass spectrum is that from primer E. Likewise when 
primer A is perfectly matched to the template in the reaction, the predominant mass label signal 
seen in the mass spectrum is from primer A. This example demonstrates the potential utility of 
using a mixture of degenerate, uniquely mass-labeled primers to determine a variable sequence 
that is adjacent to a fixed sequence. 

EXAMPLE 9 

Single-Strand Selective Chemical Release of Mass Label 

A chemically-cleavable oligonucleotide probe (SEQ ID NO: 31) containing a bridging 5'- 
S-P phosphodiester linkage in the backbone is synthesized by standard solid phase synthesis 
techniques incorporating a modified phosphoramidite reagent at the site of cleavage as described 
in PCT Patent Application WO 96/37630. The sequence of the 25-mer probe is 5'- 
CCTGGC AAACTC AACTAGGC(sT)GTCC-3 ' (SEQ ID NO: 31), where sT indicates the 
cleavage site. A complementary 35-mer oligonucleotide with the sequence 5'- 
GATCCGGACAGCCTAGTTGAGTTTGC-CAGGTAAGA-3* (SEQ ID NO: 32) is likewise 
synthesized. 

The probe and complement are hybridized together to form a duplex DNA in 1M 
triethylammonium acetate buffer by heating a mixture of 10 prnol each at 95 °C for 3 min 
followed by a 10 min incubation at 70 °C and a subsequent 50 °C 10 min incubation. The 
mixture is allowed to come to room temperature and AgN0 3 is added to a final concentration of 
0.14 raM. -The silver promoted cleavage reaction is allowed to proceed for 60 min at room 
temperature (20 °C) after which the reaction is quenched by the addition of excess dithiothreitol. 
After evaporation of the sample, 3-HPA MALDI matrix solution is added to redissolve the DNA. 
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The solution is spotted onto the mass spectrometer sample plate and analyzed. The resulting 
mass spectrum and a mass spectrum of a no-complement control cleavage are shown in FIG. 21 A 
and FIG. 2 IB, respectively. The spectrum of the control reaction shows that under the 
conditions used, the single-stranded oligonucleotide goes to about 90% complete cleavage, while 
the spectrum of the double-stranded form shows that under identical conditions not more than 
about 5% cleavage occurs. This demonstrates the potential use a chemical cleavage reagent to 
diminiscrate between hybridized and unhybridized probes for release of mass label. 

EXAMPLE 10 

Release of Mass Label by Exonuclease III Digestion of DNA Probe Hybridized to an RNA 



A pair of PCR primers for the ribosomal protein L7 gene is synthesized by standard 
phosphorarnidite chemistry. The forward primer contained at the 5' -end an extension which is 
the promoter region of T7 RNA polymerase. The sequence of the forward primer is 5'- 
TAATACGACTCACTATAGGGAGACTGCTGAGGATTGTA-GAGC-3' (SEQ ID NO: 33) 
and the sequence for the reverse primer is 5'-TCCAACAGTATAGATCTCATG-3' (SEQ ID 
NO: 34). A pair of probes is also synthesized, each containing unique mass labels. The probes 
are designed such that each hybridizes to a different strand of the PCR product while only one of 
them hybridizes to a strand of transcribed RNA. The peptide mass label used for the upper- 
strand probe had the sequence CGYGPKKKRKVGG (SEQ ID NO: 35), and for the lower-strand 
(RNA-specific) probe the peptide was CKNLNKDKQVYRATHRB (SEQ ID NO: 36). The 
synthesis of the mass-labeled probes is described in Example 1 E. 

A reverse transcription reaction was performed on 10 jig of total RNA isolated from a 
stable cancer cell line to generate first strand cDNA. The reaction was performed in a total 
volume of 20 uJL and contained 0.5^g of oligo dTO primer and 25 units of AMV reverse 
transcriptase. A PCR reaction was performed on 1 \xL of the first strand cDNA using 10 pmol 
each of the T7-forward and reverse primers and 1 unit of Taq DNA polymerase in a 20 \xL 
reaction. 
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A two microliter aliquot of the RT-PCR product is then used for a 20 microliter 
transcription reaction which contains 100 units of T7 RNA polymerase, 20 units of RNAsin 
inhibitor and 1 mM concentration of each rNTP. The transcription reaction is allowed to 
proceed at 37 °C for 2 h. One microliter of the transcription reaction product is then probed 
using 5 pmol each of the two strand specific probes above. As a control, one microliter of the 
RT-PCR product is used instead of the transcription reaction product. The probes and targets are 
hybridized in IX exonuclease III buffer by heating the mixture to 95 °C for 3 min, then 
incubating at 65 °C for 1 min then cooling to 37 °C. Exonuclease III is then added to the mixture 
and the digestion is allowed to proceed at 37 °C for 1 h. A 1 uL portion of the supernatant was 
removed and analysed by MALDI-TOF mass spectrometry (positive ions, 2,5-dihydroxy benzoic 
acid matrix). The resulting mass spectra of the digested RNA transcription product and control 
are shown in FIG. 22A and FIG. 22B respectively. Only the RNA-strand specific probe mass 
label signal is seen in the transcription reaction sample while both probe mass label signals are 
seen when the RT-PCR product is probed. The fact that only the RNA-strand specific probe 
produces a signal in the mass spectrum when RNA transcript is present, together with the fact 
that signals from both probes should be seen if the signal were resulting only from residual RT- 
PCR product, shows that the enzyme exonuclease III can be used to specifically digest a probe 
hybridized to an RNA transcript to release a mass label. 

EXAMPLE 11 
Matrix Selectivity for Peptide Mass Label or DNA 

A 2 pmol portion of each of the mass-labeled primers Primer A and Primer C is treated 
with a mild reducing agent to cleave the molecule at the disulfide bond to yield separate peptide 
and DNA fragments. For each primer, a 1 microliter portion is spotted onto the mass 
spectrometer sample plate with the matrix 2,5-dihydroxybenzoic acid, and a second 1 microliter 
portion is spotted with the matrix 3 -HP A. The mass spectrum for Primer C obtained with 2,5- 
dihydroxybenzoic acid is shown in FIG. 23A and shows a strong peptide signal with only very 
weak, poorly resolved signal at the expected mass of the DNA fragment. In contrast, the mass 
spectrum obtained with 3-HPA (FIG. 23B) shows a strong, sharp signal for the DNA fragment 
and a weaker signal for the peptide fragment. The corresponding spectra obtained for primer A 
are shown in FIG. 23C (2,5-DHB) and FIG. 23D (3-HPA). These results demonstrate that it is 
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possible to selectively detect a released mass-labeled section of a probe in the presence of the 
much larger portion of the probe not carrying a mass label 

EXAMPLE 12 

Detection of a specific biomolecule (T) in a restriction enzyme-linked immunoadsorbent 

assay 

As an example of the detection of a target biomolecule via release of mass labels, a model 
system based on ELISA technology was designed. This assay incorporates a DNA restriction 
enzyme for the digestion of a mass-labeled substrate that is ultimately detected by mass 
spectrometry. This example describes a antibody-sandwich ELISA to detect soluble antigens. 
ELISA are described in Ausubei et al (1997), incorporated herein by reference. Synthesis of the 
probe (mass label bound to double-strand oligonucleotide containing an EcoRI restriction site) is 
described in Example 1 . Double-stranded probe is prepared by hybridization of complementary 
oligonucleotides. Standard solutions of antigen T are prepared for calibration of the assay (1 - 
1000 ng/mL, depending on the linear range of the assay). Specific capture antibodies (Anti-T) 
and and a target recognition molecule crosslinked to the restriction enzyme EcoRI (Anti T- 
EcoRI) are also prepared (0.1 units of EcoRI per ng of specific antibody; 10 units per mL). 

Procedure 

1 . Coat wells of microwell dishes (Immulon or equivalent) with the capture antibody (10 
ug/mL) which then is bound overnight according to the manufacturer's instructions. Block the 
residual binding capacity of the plate with blocking buffer ( a buffered solution of 0.05% Tween 
20 and 0.25% bovine serum albumin) by filling wells with the solution and incubating 30 min at 
room temperature. Rinse plates with water threes times and remove residual water. 

2. Bind solutions of known and unknown amounts of antigen T (in blocking buffer) to 
the wells, 50 uL/well and incubate at least 2 h. Wash plate three times with water, then treat 
with blocking buffer for 10 min. Rinse again with water three times. 



\ 
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3. Add 50 u.L of Anti T-EcoRI (containing 0.5 unit of EcoRI activity) to each well and 
incubate 2 h at room temperature. Wash plate 3 times using IX EcoRI buffer containing 0.25% 
BSA. 



4. For each 96-weIl dish, mix 

140 |iL Double-strand probe (10 pmol of mass-labeled oligonucleotide, 7 uM 
stock) 

1 00 uL EcoRI buffer (1 OX) 
760 uL H20 

5. Add I G uL of the above mix to each well; incubate at 37° C for the appropriate time to 
obtain a linear response with concentration of T (up to 1 h). Heat inactivate enzyme at 65 °C for 
20 min then cool to 4 °C. Spot 1 ul of the mixture with DHB, wash dried spots 2X with 2 uL of 
H20, and analyze for the released mass label. 



All of the compositions and methods disclosed and claimed herein can be made and 
executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied to the 
compositions and methods and in the steps or in the sequence of steps of the method described 
herein without departing from the concept, spirit and scope of the invention. More specifically, 
it will be apparent that certain agents which are both chemically and physiologically related may 
be substituted for the agents described herein while the same or similar results would be 
achieved. All such similar substitutes and modifications apparent to those skilled in the art are 
deemed to be within the spirit, scope and concept of the invention as defined by the appended 
claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: GENE TRACE SYSTEMS , INC. 

(B) STREET: 333 Ravens wood Avenue, PN 083 

(C) CITY: Menlo Park 

(D) STATE: California 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 94 02 5 

(ii) TITLE OF INVENTION: RELEASABLE NONVOLATILE MASS -LABEL MOLECULES 
(iii) NUMBER OF SEQUENCES: 36 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE:- Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 23 

(D) OTHER INFORMATION: /note= "N = phosphate group" 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 1: 

TGAGGTGCGT GTTTGTGCCT GTN 



(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Cys Gly Tyr Gly Pro Lys Lys Lys Arg Lys Val Gly Gly 
1 5 10 



(2) INFORMATION FOR SEQ ID NO : 3: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 
CTCCCAGGAC AGGCACAAAC ACGCACCTCA AAGCTGTTCC GT 



(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /note= "N = amino- thymidine 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
ATC TGAAGTC AGTAAANGAA C 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 13 

.(D) OTHER INFORMATION: /note= "N = amino- thymidine * 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATTTACCAGA GANCGAG 



(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
.(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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Cys Lys Asn Leu Asn Lys Asp Lys Gin Val Tyr Arg Ala Thr His Arg 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Thr Cys Val Glu Trp Leu Arg Arg Tyr Leu Lys Asn 
15 10 



\ *. / iiu viu-ifii. i'Viv uu^ xi-* »U ; o . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Cys Ser Arg Ala Arg Lys Gin Ala Ala Ser lie Lys Val Ser Ala Asp 
15 io 15 

Arg 



(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
* ji '^'i* 'A"!' 1 A 1 'A 1 'X* f X* T *j* ^ 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 
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(B) LOCATION: 15 

<D) OTHER INFORMATION: /note= "T is amino modified." 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGTTTACATG TTCCAATATG AT 22 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Cys Gly TiTx L Ax'g Gly Ser Gly Ly£ Gly Thr Gly 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
<B) TYPE: amino acid 

( C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ix> FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "Sulfonated cysteine" 

( ix ) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) . LOCATION: 19 

(D) OTHER INFORMATION: /note= "Sulfonated cysteine" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Phe Val Asn Gin His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 
1 5 10 15 

Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Ala 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ix) FEATURE: 
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(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "Sulfonated cysteine" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Phe Val Asn Gin His Leu Cys Gly Ser His Leu Val Glu 
15 io 



INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Arg Gly Phe Phe Tyr Thr Pro Lys Ala 
1 5 



INFORMATION FOR SEQ ID NO: 15: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATIONS 

(D) OTHER INFORMATION: /note= "Sulfonated cysteine" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ala Leu Tyr Leu Val Cys Gly Glu 
1 5 



INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : modi f ied_base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /not e= "Mass label attached to an 
amino-modif ied thymidine; chemically cleavable 
disulfide -containing group between T and G" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TGTGCTCAAG AACTACATGG 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17: 

TACTCCAGTT CCATGTAGTT CTTGAGCAC 



(2) INFORMATION FOR SEQ ID NO: 18: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

TACTCCAGTA CCATGTAGTT CTTGAGCAC 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Cys Gly Arg Gly Ser Gly Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO : 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION : 1 

(D) OTHER INFORMATION: /note= "Mass label attached to an 
amino -modified thymidine; chemically cleavable 
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disulf ide-containing group between T and T n 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 20: 
TTCGGAGTCA ACGGATTTG 



(2) INFORMATION FOR SEQ ID NO : 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 21: 
TCCAGTTCTC AAATC CGTTG ACTCCGA 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs. 
'(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION :1 

(D) OTHER INFORMATION :/note= "Mass label attached to an 

amino -modified thymidine; chemically cleavable 
- disulf ide-containing group between T and G" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

TGATGTCTGT ATATGTTGCA CTG 



(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
AAGTTGACTC TCAGTGCAAC ATATACAGAC ATC 



(2) INFORMATION FOR SEQ ID NO : 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

( D ) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Cys Ala Gly Gly Arg Gly Gly Gly Lys Gly Gly Ala 
1 5 .10 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Cys Ala Ser Gly Arg Gly Ser Gly Lys Gly Ser Ala 
15 10 



(2) INFORMATION FOR SEQ ID NO : 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(A) NAME/ KEY : modif ied_base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /not e= "Mass label attached to an 
amino-modif ied thymidine; chemically cleavable 
disulfide -containing group between T and G ,f 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 26: 

TGTGCTCAAG AACTACATGA 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ix) FEATURE : 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 1 . 

(D) OTHER INFORMATION: /note= "Mass label attached to an 
amino-modif ied thymidine chemically cleavable 
disulfide -containing group between T and G" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TGTGCTCAAG AACTACATGT 2 0 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
TACT C CAG TT ACATGTAGTT CTTGAGCAC 2 9 



(2) INFORMATION FOR SEQ I-D -NQ : 29* 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Cys Ala Gly Gly Arg Gly Gly Gly Lys Gly Gly Ala 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 30: 

Cys Ala Ser Gly Arg Gly Ser Gly Lys Gly Ser Ala 
15 10 



(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
.(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /note= "Cleavage site" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



CCTGGCAAAC . TCAACTAGGC TGTCC 



25 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



GATCCGGACA GCCTAGTTGA GTTTGCCAGG TAAGA 



35 



.(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
TAATACGACT CACTATAGGG AGACTGCTGA GGATTGTAGA GC 42 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TCCAACAGTA TAGATCTCAT G 21 



(2) INFORMATION FOR SEQ ID NO : 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Cys Gly Tyr Gly Pro Lys Lys Lys Arg Lys Val Gly Gly 
15 10 



WO 98/26095 



100 



PCT/US97/22639 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



Cys Lys Asn Leu Asn Lys Asp Lys Gin Val Tyr Arg Ala Thr His Arg 
1 5 ^ 10 15 
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CLAIMS: 

A release tag compound comprising Rx, Re and M wherein: 
Rx is a reactive group; 
Re is a release group; and 

M is a nonvolatile mass label comprising a synthetic polymer or a biopolymer detectable 
by mass spectrometry. 

The compound of claim 1 , wherein the mass label comprises a biopolymer. 

The compound of claim 2, wherein the biopolymer comprises one or more monomer 
units, wherein each monomer unit is separately and independently selected from the 
group consisting essentially of an amino acid, a nucleic acid, and a saccharide. 

The compound of claim 3, wherein each monomer comprises an amino acid. 

The compound of claim 3, wherein each monomer comprises a nucleic acid. 

The compound of claim 1, wherein the mass label comprises a synthetic polymer. 

The compound of claim 6, wherein the synthetic polymer comprises polyethylene glycol, 
polyvinyl phenol, polypropylene glycol, polymethyl methacrylate, and derivatives 
thereof. 

The compound of claim 7, wherein the synthetic polymer comprises polyethylene glycol. 

The compound of claim 1, wherein the release group comprises a chemically cleavable 
linkage. 

The compound of claim 9, wherein the chemically cleavable linkage comprises a 
modified base, a modified sugar, a disulfide bond, a chemically cleavable group 
incorporated into the phosphate backbone, or a chemically cleavable linker. 
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1 1. The compound of claim 10, wherein the chemically cleavable linkage further comprises a 
moiety cleavable by acid, base, oxidation, reduction, heat, light, metal ion catalyzed, 
displacement, or elimination chemistry. 

2. The compound of claim 10', wherein the chemically cleavable group is incorporated into 
the phosphate backbone. 



3. The compound of claim 12, wherein the chemically cleavable group comprises 
dialkoxysilane, 3'-(S)-phosphorothioate, 5'-(S)-phosphorothioate, 3'-(N> 
phosphoroamidate, or 5'-(N)-phosphoroamidate. 

4. The compound of claim 10, wherein the chemically cleavable linkage comprises a 
modified sugar. 

5. The compound of claim 14, wherein the modified sugar comprises ribose. 

6. The compound of claim 10, wherein the chemically cleavable linkage comprises a 
disulfide bond. 

The compound of claim 1, wherein the reactive group comprises a biomolecule capable 
of specific molecular recognition. 

The compound of claim 17, wherein the biomolecule capable of specific molecular 
recognition comprises a polypeptide. 

The compound of claim 18, wherein the polypeptide is selected from the group consisting 
essentially of an antibody, an enzyme, a receptor, a regulatory protein, a nucleic acid- 
binding protein, a hormone, and a protein product of a display method. 

The compound of 19, wherein the polypeptide comprises a product of a phage display 
method or a bacterial display method. 

The compound of claim 19, wherein the polypeptide comprises an antibody or an 
enzyme. 
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22. The compound of claim 17, wherein the biomolecule capable of specific molecular 
recognition comprises a polynucleic acid. 

23. The compound of claim 22, wherein the polynucleic acid comprises an oligonucleotide. 

24. The compound of claim 1, wherein Rx and Re are the same. 

25. The compound of claim 1, wherein Re is contained within Rx. 

26. The compound of claim 1, wherein Re is cleavable by an enzyme. 

27. The compound of claim 26, wherein Re is a phosphodiester or amide linkage. 

28. The compound of claim 26, wherein the enzyme comprises a nuclease. 

29. The compound of claim 28, wherein the nuclease comprises an exonuclease. 

30. The compound of 29, wherein the exonuclease is specific for double-stranded polynucleic 
acids. 

31. The compound of claim 29, wherein the exonuclease is specific for single-stranded 
polynucleic acids. 

32. The compound of claim 28, wherein the nuclease comprises a restriction endonuclease. 

33. The compound of claim 32, wherein the restriction endonuclease comprises a Type IIS 
restriction endonuclease. 

34. The compound of claim 32, wherein the restriction endonuclease comprises a Type II 
. restriction endonuclease. 

35. The-compound of claim 27, wherein the enzyme comprises a protease. 

36. The compound of claim 35, wherein the protease comprises an endoproteinase. 
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37. The compound of claim 1, wherein more than one mass label is incorporated. 



38. The compound of claim 1, wherein the mass label has a molecular weight greater than 
about 500 Daltons. 

39. A release tag compound comprising Rx, Re and M, wherein: 

Rx is a reactive group synthesized using mass-labeled nucleoside triphosphates; 
Re is a release group; and 

M is a mass label detectable by mass spectrometry. 

40. , A release tag compound comprising Rx, Re and M 3 wherein: 

Rx is a reactive group synthesized using mass-labeled nucleoside phosphoramidites; 
Re is a release group; and 

M is a mass label detectable by mass spectrometry. 

41. A release tag compound comprising Rx, Re and M, wherein: 

Rx is a reactive group comprising a first oligonucleotide having a nucleotide or a second 
oligonucleotide attached thereto; 
Re is a release group; and 

M is a mass label detectable by mass spectrometry; and 
wherein said nucleotide or second oligonucleotide is added after hybridizing said first 
oligonucleotide to a complementary nucleic acid sequence. 

42. The compound of claim 41, wherein the nucleotide added after hybridization comprises a 
chain terminating modification. 

43. The compound of claim 41, wherein the nucleotide or second oligonucleotide further 
comprise a functional group capable of being immobilized on a solid support. 

44. The compound of claim 43, wherein the functional group comprises a biotin, or 
digoxigenin. 

45. A release tag compound comprising Rx, Re and M,wherein: 
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Rx is an oligonucleotide comprising a nuclease blocking moiety; 
Re is a release group; and 

M is a mass label detectable by mass spectrometry. 

46. The compound of claim 45, wherein the nuclease blocking moiety is selected from the 
group consisting essentially of a phosphorothioate, an alkylsilyldiester, a 
boranophosphate, a methylphosphonate, and a peptide nucleic acid. 

47. A release tag compound comprising Rx, Re and M, wherein: 

Rx is a double-stranded oligonucleotide comprising a restriction endonuclease 
recognition site; 

Re is a release group comprising a phosphodiester linkage capable of being cleaved by a 

restriction endonuclease; and 

M is a mass label detectable by mass spectrometry. 

48. The compound of claim 47, wherein Rx further comprises a modified nucleotide. 

49. The compound of claim 48, wherein M comprises a portion of Rx. 

50. The compound of claim 47, wherein Rx further comprises a self-complementary 
oligonucleotide hairpin. 

51 . A release tag compound comprising Rx, Re and M, wherein: 
Rx is a double-stranded oligonucleotide; 

Re is a chemically cleavable release group; and 

M is a mass label detectable by mass spectrometry; 
wherein Re is located within Rx; and wherein cleavage at the chemically cleavable release group 
is inhibited by the presence of a double- stranded oligonucleotide at said release group. 



52. 



The compound of claim 51, wherein the chemically cleavable release group comprises 3- 
(S)-phosphorothioate, 5'-(S)-phosphorothioate, 3 '-CN)-phosphoroamidate, 5 f -(N)- 
phosphoroamidate, or ribose. 
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53. The compound of claim 51, wherein hybridization of a portion of Rx to a target nucleic 
acid renders a portion of Rx single-stranded at Re. 

54. A set of release tag compounds, said set comprising one or more release tag compounds 
for detecting a particular target nucleic acid, each release tag compound comprising Rx, Re and 
M, wherein: 

Rx is an oligonucleotide comprising a variable region and an invariant region; 
Re is a release group; and 

M is a mass label detectable by mass spectrometry; and 
wherein said invariant and variable regions react with the target nucleic acid. 

55. The set of claim 54, wherein the mass label of at least one release tag compound 
identifies a specific sequence within the variable region. 

56. The set of claim 55, wherein the mass label for each release tag compound uniquely 
identifies a different sequence within the variable region. 

57. The set of claim 54, wherein a combination of the mass labels of two or more release tag 
compounds identifies a different sequence within the variable region. 

58. The set of claim 54, wherein Rx further comprises a nucleotide or oligonucleotide 
attached thereto after hybridization to the target nucleic acid. 

59. The set of claim 58, wherein the added nucleotide or oligonucleotide comprises Re 1 and 
M', wherein: 

Re' is a release group; and 

M' is a mass label detectable by mass spectrometry. 

60. The set of claim 58, wherein the added nucleotide comprises a chain terminating moiety. 



61. 



The set of claim 58, wherein the added nucleotide or oligonucleotide further comprises a 
functional group capable of being immobilized on a solid support. 
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62. The set of claim 61, wherein the functional group comprises a biotin or digoxigenin. 



63. A method for detecting a target molecule, said method comprising the steps of: 

(a) obtaining a target molecule; 

(b) amplifying the target molecule to produce an amplified target molecule; 

(c) obtaining a probe comprising a reactive group, a release group and a mass label; 

(d) hybridizing the amplified target molecule to the probe to produce a 
probe:ampliried target molecule complex; 

(e) releasing the mass label from the probe:amplified target molecule complex to 
obtain a released mass label; and 

(d) determining the mass of the released mass label by mass spectrometry. 

64. The method of claim 63, wherein the amplified target molecule comprises a functional 
group capable of being immobilized on a solid support. 

65. The method of claim 64, wherein the functional group comprises a biotin or digoxigenin. 

66. The method of claim 64, wherein the functional group is attached to an oligonucleotide 
primer incorporated into the amplified target molecule during the amplification step. 

67. The method of claim 64, wherein the functional group is attached to a nucleotide 
incorporated into the amplified target molecule during the amplification step. 

68. The method of claim 64, wherein the amplified target molecule is immobilized onto the 
solid support and any probe not part of a probe: amplified target molecule complex is 
removed by washing. 



69. 



The method of claim 63, wherein the mass label is released by an enzyme. 



70, 
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The method of claim 69, wherein the enzyme comprises a nuclease. 



71. The method of claim 70, wherein the nuclease comprises a restriction endonuclease. 

72. The method of claim 70, wherein the nuclease comprises an exonuclease. 

73. The method of 72, wherein the exonuclease is specific for double-stranded DNA. 

74. The method of claim 73, wherein the exonuclease is selected from the group consisting 
essentially of exonuclease III, T4 endonuclease VII, lambda exonuclease, and DNA 
polymerase. 

75. The method of claim 73, wherein the release of the mass label is triggered by the 
hybridization of the probe to the amplified target molecule. 

76. The method of claim 72, wherein the exonuclease is specific for single-stranded DNA. 

77. The method of claim 63, wherein the release group comprises a chemically cleavable 
linkage. 

78. The method of claim 77, wherein the chemically cleavable linkage comprises a modified 
base, a modified sugar, a disulfide bond, a chemically cleavable group incorporated into 
the phosphate backbone, or a chemically cleavable linker. 

79. The method of claim 78, wherein the chemically cleavable linkage further comprises a 
moiety cleavable by acid, base, oxidation, reduction, heat, light, metal ion catalyzed, 
displacement, or elimination chemistry. 

80. The method of claim 79, wherein the chemically cleavable linkage comprises a disulfide 
bond. 

81. The method of claim 63, wherein the reactive group further comprises a nucleotide or 
oligonucleotide added after hybridization of the probe, to the amplified target molecule. 
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82. The method of claim 81, wherein the added nucleotide or oligonucleotide further 
comprises a functional group capable of being immobilized on a solid support. 

83. The method of claim 82, wherein the nucleotide is added by a polymerase. 

84. The method of claim 82, wherein the oligonucleotide is added by a Iigase. 

85. The method of claim 82 further comprising: 

(a) immobilizing the reactive group onto the solid support after addition of the 
nucleotide or oligonucleotide; and 

(b) removing any probes having unbound reactive groups prior to releasing the mass 
label of any probe having a bound reactive group. 

86. The method of claim 63, wherein the reactive and release groups are the same. 

87. The method of claim 63, wherein the release group is contained within the reactive group. 

88. The method of claim 63, wherein the probe comprises at least two mass labels having 
different masses. 

89. The method of claim 63, wherein the target molecule is contacted with a plurality of 
probes. 

90. The method of claim 89, wherein each reactive group is associated with a unique mass 
label. 

91. The method of claim 90, wherein each reactive group is associated with a unique set of 
mass labels. 

92. The method of claim 91, wherein the set of mass labels are attached to the same probe. 
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93. The method of claim 91, wherein each member of the set of mass labels is attached to a 
different probe. 

94. The method of claim 63, wherein the amplified target molecule comprises a double- 
stranded molecule, each strand having a 3' end and a 5' end, said double -stranded 
molecule containing a mismatch and the 3* ends are not capable of being digested by an 
exonuclease. 

95. The method of claim 94, further comprising 

cleaving at least one strand of the double-stranded molecule at the mismatch; and 
selectively releasing the mass label by digesting the cleaved strand with a 3' to 5' 
exonuclease. 



96. The method of claim 95, wherein the mismatch is cleaved by an enzyme. 

97. The method of claim 96, wherein the enzyme comprises mutHLS, T4 endonuclease VII, 
mutY DNA glycosylase, thymine mismatch DNA glycosylase, or endonuclease V. 

98. The method of claim 95, wherein the mismatch is cleaved by a chemical. 

99. The method of claim 98, wherein the chemical comprises Os0 4 , HONH 2 , or KMn0 4 . 

100. The method of claim 95, wherein the 3' to 5' exonuclease comprises exonuclease III. 

101 . A method for detecting a target molecule, said method comprising the steps of: 

(a) obtaining a probe comprising a reactive group, a release group and a nonvolatile 
mass label; 

(b) obtaining a target molecule; 

(c) contacting the target molecule with the probe to produce a probe:target molecule 
complex; 

(d) selectively releasing the mass label from the probeitarget molecule complex; and 

(e) determining the mass of the mass label by mass spectrometry. 
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102. The method of claim 101, wherein the mass label is selectively released by an enzyme. 

103. The method of claim 102, wherein the enzyme comprises a nuclease. 

104. The method of claim 103, wherein the enzyme comprises a restriction endonuclease. 

105. The method of claim 104, wherein the enzyme comprises an exonuclease. 

106. The method of 105, wherein the exonuclease is specific for double- stranded DNA. 

107. The method of claim 106, wherein the selective release by the exonuclease is triggered by 
the hybridization of the probe to the target molecule. 

108. The method of claim 101, wherein the release group is located within reactive group and 
the reactive group is an oligonucleotide. 

109. The method of claim 108, wherein the selective release of the mass label is inhibited by 
the presence of a double-stranded oligonucleotide at said release group. 

110. The method of claim 109, wherein contacting the probe with the target molecule results 
in the release group being present in a single-stranded region. 

111. The method of claim 110, wherein the release group comprises a chemically cleavable 
release group. 

1 12. The method of claim 111, wherein the chemically cleavable release group comprises 3'- 
(S)-phosphorothioate, 5'-(S)-phosphorothioate, 3 , -(N)-phosphoroamidate, 5*-(N)- 
phosphoroamidate, or ribose. 

113. The method of claim 110, wherein the release group is cleavable by a single-strand- 
specific nuclease. 

114. The method of claim 101, wherein the reactive group comprises a polynucleotide or an 
oligonucleotide. 
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1 15. The method of claim 1 14, wherein the reactive group further comprises a nucleotide or an 
oligonucleotide added after hybridization to the target molecule. 

1 16. The method of claim 1 15, wherein the nucleotide is added by a polymerase. 

117. The method of claim 1 1 5, wherein the oligonucleotide is added by a ligase. 

118. The method of claim 1 15, wherein the nucleotide or oligonucleotide further comprises a 
functional group capable of being immobilized on a solid support. 

119. The method of claim 118, wherein the functional group comprises a biotin or 
digoxigenin. 

120. The method of claim 118 further comprising: 

(a) immobilizing the reactive group onto the solid support after the addition of the 
nucleotide or oligonucleotide; and 

(b) removing any probes having unbound reactive groups prior to releasing their mass 
labels. 

121 . A method for multiplexing the detection of a target molecule comprising: 

(a) obtaining a plurality of probes, each comprising a reactive group, a release group 
and a mass label; 

(b) contacting the target molecule with the plurality of probes to produce probe:target 
molecule complexes, wherein the target molecule is attached to the reactive group 
of the probe; 



(c) 



releasing the mass labels from the probe:target molecule complexes to produce 
released mass labels; and 
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(d) determining the mass of the released mass labels by mass spectrometry, wherein 
each reactive group in a probe:target molecule complex is associated with a 
unique set of mass labels. 

122. The method of claim 121, wherein a plurality of target molecules is contacted with the 
plurality of probes. 

1 23. The method of claim 121, wherein the set of mass labels are attached to the same probe. 

124. The method of claim 121 , wherein the set of mass labels are attached to different probes. 

125. The method of claim 121, wherein the target molecule is immobilized onto a solid 
support. 

126. The method of claim 124, wherein a plurality of target molecules are immobilized onto 
the solid support at spaced locations. 

127. The method of claim 121, wherein the target molecule is selected from the group 
consisting essentially of a polynucleotide, an antigen, a ligand, a polypeptide, a 
carbohydrate, and a lipid. 

128. The method of claim 127, wherein the target molecule comprises a polynucleotide. 

129. The method of claim 127, wherein the target molecule comprises a polypeptide. 

130. The method of claim 121, wherein the reactive group comprises a biomolecule capable of 
specific molecular recognition. 

131 . The method of claim 130, wherein the reactive group comprises a polypeptide. 



132. 



The method of claim 131, wherein the polypeptide is selected from the group consisting 
essentially of an antibody, an enzyme, a receptor, a regulatory protein, a nucleic-acid- 
binding protein, a hormone, or a protein product of a display methods. 
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133. The method of claim 132, wherein the polypeptide comprises an antibody. 

134. The method of claim 132, wherein the polypeptide comprises a product of a phage 
display method or a bacterial display method. 

135. The method of claim 130, wherein the reactive group comprises a polynucleotide or an 
oligonucleotide. 

136. The method of claim 121, wherein the unique set of mass labels comprises a mass label 
that indicates the presence of a specified component within the reactive group. 

137. The method of claim 136, wherein the mass label indicates the presence of the specified 
component at a specified location within the reactive group. 

138. The method of claim 137, wherein a reactive group comprising n specified components is 
associated with a unique set of mass labels having n members. 

139. The method of claim 138, wherein n is from 1 to 1000. 

140. The method of claim 137, wherein a reactive group comprising n specified components is 
associated with a unique set of mass labels having y members wherein n is less than 
y!/[x!(y-x)!]; and 

wherein x comprises the number of mass labels used per reactive group. 

141. The method of claim 137, wherein the plurality of probes each comprise a known reactive 
group having a known set of mass labels. 

142. The method of claim 141, wherein the plurality of probes are prepared by combinatorial 
synthesis. 



143. 



The method of claim 137, wherein the plurality of target molecules comprise a known 
chemical structure. 
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144. The method of claim 136, wherein the target molecule comprises an expressed gene 
product. 

145. The method of claim 144, wherein the expressed gene product is derived from a cloned 
mRNA. 

146. The method of claim 136, wherein the target molecule comprises a cloned genomic DNA. 

147. A method of monitoring gene expression comprising: 

(a) obtaining a plurality of probes, each probe comprising a reactive group, a release 
group and a mass label; 

(b) contacting a plurality of target nucleic acids with the plurality of probes to 
produce probe:target nucleic acid complexes; 

(c) selectively releasing the mass labels from the probe:target nucleic acid complexes, 
wherein the complexes are in solution; and 

(d) determining the mass of the released mass labels by mass spectrometry. 

148. The method of claim 147, wherein the target nucleic acids comprise mRNA or first- 
strand cDNA. 

149. The method of claim 147, wherein the target nucleic acids comprise amplified nucleic 
acid products. 

150. The method of claim 149, wherein the amplified nucleic acid products are produced using 
PCR, rtPCR, LCR, Qbeta Replicase, SDA, CPR, TAS, NASBA, or multiple rounds of 
RNA transcription or some combination thereof. 



151. 



A method of monitoring gene expression comprising: 
(a) obtaining an mRNA pool; 
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(b) amplifying a subset of the mRNA pool to produce a plurality of amplified nucleic 
acid products 

(c) obtaining a plurality of probes, each probe comprising a reactive group, a release 
group and a mass label; 

(d) contacting the plurality of amplified nucleic acid products with the plurality of 
probes to produce probe :amplified nucleic acid product complexes; 

(e) selectively releasing the mass label from the probe:amplified nucleic acid product 
complexes to produce released mass labels; and 

. (f) determining the mass of the released mass labels by mass spectrometry. 

152. The method of claim 151, wherein at least one probe is capable of being immobilized 
onto a solid support. 

153. The method of claim 151, wherein at least one amplified nucleic acid product is capable 
of being immobilized onto a solid support. 

154. A method for detecting a target molecule, said method comprising the steps of: 

(a) contacting a target molecule with a probe comprising a reactive group, a release 
group and a nonvolatile mass label to produce probe:target molecule complexes 
and unreacted probes; 

(b) releasing the mass labels from the probertarget molecule complexes to produce 
released mass labels; 

(c) selectively desorbing the released mass label from an organic matrix to produce 
desorbed mass labels; and 

(d) determining the mass of the desorbed mass label by mass spectrometry. 
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155. The method of claim 154, wherein the organic matrix comprises 2,5-dihydroxybenzoic 
acid, sinapinic acid, or alpha-cyano-4-hydroxycinnamic acid. 

156. A method for detecting a target molecule, said method comprising the steps of: 

(a) amplifying a target nucleic acid to produce amplified nucleic acid products; 

(b) obtaining one or more first molecules, said first molecules comprising a reactive 
group, a release group and a nonvolatile mass label; 

(c) incorporating said first molecules into the amplified nucleic acid product during 
the amplification process to produce incorporated mass labeled molecules and 
unincorporated mass labeled molecules; 

(d) releasing the mass labels incorporated into the amplified nucleic acid products to 
produce released mass labels; and 

(e) determining the mass of the released mass labels by mass spectrometry. 

157. The method of claim 156, wherein the molecules are oligonucleotide primers. 

158. The method of claim 156, wherein the molecules are nucleoside triphosphates. 

159. The method of claim 156, wherein the amplified nucleic acid products are produced using 
PGR, rtPCR, LCR, Qbeta Replicase, SDA, CPR, TAS, NASBA, or multiple rounds of 
RNA transcription or some combination thereof 

160. The method of claim 159, wherein the amplified nucleic products are produced using 
PCR or rtPCR. 

161. The method of claim 156, wherein the mass label is released by an enzyme. 



162. The method of claim 161, wherein the enzyme comprises a nuclease. 
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163. The method of claim 162, wherein the nuclease comprises a restriction endonuclease. 

164. The method of claim 163, wherein the restriction endonuclease comprises a Type IIS 
restriction endonuclease. 

165. The method of claim 163, wherein the restriction endonuclease comprises a Type II 
restriction endonuclease. 

166. The method of claim 162, wherein the nuclease comprises an exonuclease. 

167. The method of claim 166, wherein the exonuclease is specific for double-stranded DNA. 

168. The method of claim 167, wherein the exonuclease is selected from a group consisting of 
exonuclease III, T4 endonuclease VII, lambda exonuclease, and DNA polymerase. 

169. The method of claim 156, wherein the release group comprises a chemically cleavable 
linkage. 

170. The method of claim 169, wherein the chemically cleavable linkage comprises a modified 
base, a modified sugar, a disulfide bond, a chemically cleavable group incorporated into 
the phosphate backbone , or a chemically cleavable linker. 

171. The method of claim 170, wherein the chemically cleavable linkage further comprises a 
moiety cleavable by. acid, base, oxidation, reduction, heat, light, metal ion catalyzed, 
displacement or elimination chemistry. 

172. The method of claim 171, wherein the chemically cleavable group comprises a 
chemically cleavable group incorporated into the phosphate backbone. 



173. 



The method of claim 172, wherein the chemically cleavable group comprises 
dialkoxysilane, 3'-(S)-phosphorothioate, 5'-(S)-phosphorothioate, 3'-(N)- 
phosphoroamidate, or 5'-(N)-phosphoroamidate. 



WO 98/26095 ^ PCT/US97/22639 

174. The method of claim 171, wherein the chemically cleavable linkage comprises a modified 
sugar. 

175. The method of claim 174, wherein the modified sugar comprises ribose. 

176. The method of claim 171, wherein the chemically cleavable linkage comprises a disulfide 
bond. 

177. The method of claim 156, wherein one or more second molecules are incorporated into 
the amplification nucleic acid products, said second molecules comprising a functional 
group capable of being immobilized on a solid support,. 

178. The method of claim 177, wherein the functional group comprises a biotin or 
digoxigenin. 

1 79. The method of claim 1 77, wherein the functional group comprises a linker molecule 
capable of forming a covalent linkage to a solid support. 

180. The method of claim 177, wherein the second molecules are oligonucleotide primers. 

181. The method of claim 177, wherein the second molecules are nucleoside triphosphates. 

182. The method of claim 177, wherein the functional group binds the amplified nucleic acid 
products to a solid support, and separate incorporated mass labeled molecules from 
unincorporated mass labeled molecules. 

183. The method of claim 156, wherein the amplified nucleic acid products are separated from 
unincorporated mass labeled molecules. 

184. The method of claim 183, comprising binding the amplified nucleic acid products to a 
solid support. 



185. 



The method of claim 183, comprising hybridizing the amplified nucleic acid products to a 
polynucleotide bound to a solid support. 
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186. The method of claim 185, wherein the bound polynucleotide is an oligonucleotide, a 
polyribonucleotide, a plasmid, an M13, a cosmid, a PI clone, a BAC or a YAC. 

187. The method of claim 185, comprising hybridizing the amplified nucleic acid products to a 
plurality of polynucleotides immobilized onto the solid support at spaced locations. 

188. A method for detecting the presence of a target nucleic acid molecule, said method 
comprising: 

(a) obtaining a probe comprising a reactive group, a release group and a mass label; 

(b) contacting the probe to a target nucleic acid molecule to produce probe:nucleic 
acid molecule complexes; 

(c) mass modifying the probernucleic acid molecule complexes by attaching a 
nucleotide or oligonucleotide to the probe to produce mass modified mass labels; 

(d) releasing the mass modified mass labels; and 

(e) determining the mass of the mass-modified mass labels by mass spectrometry. 

189. A method for detecting specific biomolecules in an enzyme-linked affinity assay 
comprising 

(a) obtaining a substrate; 

(b) contacting a target molecule with an affinity ligand-enzyme conjugate to produce 
an affinity ligand-enzyme conjugate rtarget molecule complex; 

(d) contacting the affinity ligand-enzyme conjugate:target molecule complex with the 
substrate to produce a mass modified product; and 

(e) . determining the mass of the mass modified product by mass spectrometry. 



190. 



The method of claim 189, wherein the enzyme is a restriction endonuclease. 
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191. The method of claim 190, further comprising a plurality of restriction endonuclease 
conjugates. 

192. The method of claim 190, wherein the substrate comprises a restriction site. 

193. The method of claim 189, wherein the affinity ligand is a biomolecule capable of specific 
molecular recognition. 

194. The method of claim 193, wherein the affinity ligand is a polypeptide. 

195. The method of claim 194, wherein the polypeptide is selected from the group consisting 
essentially of an antibody, an enzyme, a receptor, a regulatory protein, a nucleic acid- 
binding protein, a hormone, or a protein product of a display method. 

196. The method of claim 195, wherein the polypeptide comprises a product of a phage 
display method or a bacterial display method. 

197. The method of claim 194, wherein the polypeptide comprises an antibody. 

198. The method of claim 193, wherein the affinity ligand is a polynucleic acid. 

1 99. The method of claim 1 89, wherein the enzyme is a protease. 

200. The method of claim 1 99, wherein the substrate is a polypeptide. 
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